FIGURE l 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGG TC TGAAGACTAAC AT T T TGTGAAGT T GTAAAACAGAAAACC TGT TAGAAATGTGGT GGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACAT TACTGCAGTAACAC T CCACCATATAGACCCGGCTT TACC T TATATCAGTGACACTGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
P TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTG6ATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTTTGCACAGTGGCAAT T TT GGGACTGATT TAGAACAGAAACTCCATTGGAACCCCGAGG 

y i 

% ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 

p| GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 

is T GGAT TAACCCT CT AT GACAC TGCACCT T GCC CTATTAACAAT GAACGAACACGGCTACT T TCCA 

U* GAGATATTTGAT GAAAGGATAAAATATTTCTGTAATGATTATGATTCTCAGGGAT TGGGGAAAGG 

III 

J; TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 

Ss GAT GAAT GC TGATAAT CAGGAAACATGAAAGAAGCCAT TT GATAGATTATT CTAAAGGAT AT CAT 

m 

P CAAGAAGAC TAT TAAAAACACCT AT GCC TATACT TT T T TATCTCAGAAAATAAAGTCAAAAGACT 

p| ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLS FLPSALVI WTSAAFI FSYI TAVTLHHI DPALPYI SDTGTVAPEKCLFGi^MLNIAAV 
LCIATIYVRYKQVHALSPEENVI IKLNKAGLVLGILSCLGLS IVAKFQKTTLFAAHVSGAVLTFG 
MGSLYMBYQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE 3 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGA.ee 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
C T T AC AC T T CGCCATGAG T T T CCT CATC GAC T C CAG CAT CAT GAT T ACC T C C CAGATAC T ATT T T TT G 
GAT TTGGGTGGCTTTT CTTCATGCGCCAATTGT TTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TT T TCAT GGTGCCTTTTTACATTGGCTAT TTTATTGT GAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CAT TCT CAGCCCAAAACATGGGATCTTATCCATAGAACAGCT CATCAGCCGGGTTGGTGT GATT GGAG 

ttf TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 

til 

Lb CTCAGGAATGT GACTGACAC GGATATTCT AGCCCTGGAACGGCGACT GCTGCAAACCAT GGATATGAT 

fell 

1 1| CATAAGCAA?y\AGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 

B jK AAC CAT CAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGT GAAAATCT TACT 

111 CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 

$ 



AT AT GC T ACCAAGGAGAGAAT AGAATAC T CCAAAACC T T CAAGGG GAAAT AT TTT AATT T T CTT GGT T 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
GGGAAAACGGATCCTGTCACAAGAGGCAT TGAGATCACT GT GAATTAT C TGGGAATCCAATTTGATGT 
GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 



rj TGC T GAT CACT C T TAC CAAGTTC T T TT AT GCC ATCT C T AGC AGT AAGT CCT CCAAT GTCAT TGT CCT G 

IH CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATAC CGCACCAT AATCACTGAAGT CC TT G GAGAAC T G C AGTT CAAC TT CT AT C ACC GT T GG T TT G 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACC^GGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG, 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCAT CC CGT GTGGATAT GAGGCTGG 
TGT AGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAAC T CTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 4 
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MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFSLIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFIYrV^ 

FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TWYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVSSVLLIRMSMPLEYRTI ITEVLGELQFNFYHRWFDVI FLVSALSS ILFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 



||| N-glycosylation sites. 



amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCAT CCGGACTAG T TAT TGAGCATC TGC C T CT CATATCACCAGTGGCCATCT GAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTAT GGGGAT CACCCT TGTGAGCAJ\AAAGGCGAACCAGCAGCTGAAT T T CACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
AT T ATCACCACCAAAGATCCCAT AT TCAACAC T CAAACTGCAACACAAACAACAGAAT T TATTGT 
r fc CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CI CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
rj| GAAACTAGCACCATGTCTACAGAAACTGAACCATTTGTTGAAAATAAAGCAGCATTCAAGAATGA 
P AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
W CAGCAGAAGGAAAT GATCGAAACCAAAG TAGTAAAGGAGGAGAAGGC CAAT GATAGCAACCCTAA 

III TGAGGAATCAAAGAAAAC TGATAAAAACCCAGAAGAG T CCAAGAGTCCAAGCAAAACTACCGTGC 

& |; GATGCCTGGAAGCTGAAGT TTAGATGAGACAGAAATGAGGAGACACACCTGAGGCTGGTT TCT T T 

m CAT GCTCC T TAC CCTGCCCCAGCT GGGGAAATCAAAAGGGCCAAAGAAC CAAAGAAGAAAGT CCA 

y * CCCTTGGT TCCTAACT GGAATCAGC T CAGGACTGCCAT T GGAC TATGGAG T GCACCAAAGAGAAT 

GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
El TTC TAGCC TGGC TATGTCCTAAT AATATCCCAC TGGGAGAAAGGAGT T T T GCAAAGTGCAAGGAC 

||| CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
|i| AAAGCC AAGGAGT C AC T GAGACC AAG GC T T T C T C TAC T GAT T C C GCAGC T C AGAC CCTTTCTT CA 

P GCT CTGAAAGAGAAACACGTATCCCACCTGACATGTCCTT CTGAGCCCGGTAAGAGCAAAAGAAT 

W GGCAGAAAAGTTTAGCCCCTGAM.GCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA. 
| s | GCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACTGTCAGCAGGGACTGTAAAC 
III ACAGACAGGGTCAAAGTGT TT TCTCTGAACACATTGAGTTGGAATCACTGTTTAGAACACACACA 

CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAATAAAAACTCTTATAAAT T TCTATT TT TATCTGAGTTACAGAAATGATTACTAAGGAAGAT T 
ACTCAGTAATT TGTT TAAAAAGTAATAAAATTCAACAAACATT TGCTGAATAGCTACTATAT GT C 
AAGTGCTGTGCAAGGTATTACACTCTGTAATTGAATATTATTCCTCAAAAAATTGCACATAGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 
TAT TAT TAACATAC CTAAGAAGT ACAT T GT TACCTC TATATACCAAAGCACAT TT TAAAAG T GC C 
ATT AACAAATGT AT CACTAGC CCTC CT T TT TCCAACAAGAAGGGAC T GAGAGATGCAGAAAT AT T 
TGT GACAAAAAATTAAAGCAT TTAGAAAACTT 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
LS LAGKDQVSTALKAS FETCS YGWVGDGFWI SRI SPNPKCGKNGVGVL IWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVF^4ETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVIJU J LFFGAAAGLGFCWK 
RYVKAFPFTNKNQQKEMIETKWKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site* 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCATCATCAGAAGT GAACC TGGCAAACTTACCTCCCAGC TATCACAATGAGACCAACACAGAC 
ACGAAGGT TGGAAATAATACCAT CCAT G T GCACCGAGAAAT TCACAAGATAACCAACAACCAGAC 
y s T GGACAAATGGT CT TTTCAGAGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 

ACGAGT GCAT CATC GACGAGGAC TGTGGGCCCAGCAT GTAC TGCCAGT TTGCCAGC T TCCAGTAC 
Hi ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
If GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
&* ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ill ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
III CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
4; CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
01 GAGATCCTGC TGCCCAGAGAGGTCCCCGATGAGTAT GAAGT TGGCAGCTTCATGGAGGAGGT GCG 

CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
CI CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
III GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
IJI TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
||| TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 

CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTG T TAGGAACAGCAG TGT T CT CACAGTG TGGGGCAGCCG T CCT TCTAATGAAGACAAT GAT 
AT T GACAC TGTCCCT CTTTGGCAGT TGCAT TAGTAAC TT TGAAAGGT AT AT GAC TGAGCGTAGCA 
T ACAGGT TAACC TGCAGAAACAGTAC T TAGGTAAT TG T AGGGCGAGGAT TATAAAT GAAATT TGC 
AAAATCAC T TAGCAGCAAC TGAAGACAAT TAT CAACCACGTGGAGAAAAT CAAACC GAGCAGGGC 
T GTGTGAAACAT GGT TGTAATAT GCGAC T GCGAACACTGAACT C TAG GCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GAT TGTATAAGCATGCT TT CT TT GAGT T T TAAAT TAT GTATAAACATAAG T TGCAT T TAGAA 
ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKAS SEVNLANLPPS YHNETNTDTKVGNNT IHVHRE IHKI TNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDKQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RS LTEEMALGE PAAAAAALLGGEE I 



Cl Signal sequence: 

amino acids 1-19 

If! 



CI! 



y! 
B 
ry 



N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 



M N-myristoylation site. 

Nt amino acids 202-208, 217-223 



Amidation site. 

amino acids 140-144 
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FIGURE Q 

CGGA CGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTT^ATCCAACGTTCTGTCTTGAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
CAAG TGCTGCCT C T GGTGTC T GGAAAAAT TTATCAAGT TCC TAAACC GCAATGCATACAT CAT GA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGAT TCTGGGCAAGAAGAACGAGGC GCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GAT CGCGCCACT GCACT CCAACCT GGGT GACAGACT CT GT C TC CAAAACAAAACAAACAAACAAA 
AAGAT TTTAT TAAAGATAT TT TGT TAAC T C 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 

NWVLALGQCVLAGAFAS FYWAFHKPQD I PT FPL I S AFIRTLRYHTGSLAFGAL I LTLVQI ARVT L 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IWVVVLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites, 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 27 6-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
T T T GT CAC CTGGT CAG C CC T ATCCAGTAT CC CT GAAC AGAAAT GCAACC CC CATT T G CCAAC CCAGC T 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCT.TCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCAS CLCGSAPC I LCS CCPASRNS TVSRL I FT FFL FLGVLVS I IMLS PGVES QL 
YKLPWVCEEGAGI PTVLQGHI DCGS LLGYRAVYRMC FATAAFFFFFFTLLMLCVS S SRDPRAAI Q 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLS IAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 
QDAQPNSGLLQASVI TLYTMFVTWSALSS IPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPS I 
VGLI IFLLCTLFI SLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
S FFHFCLVLASLHVMMTLTNWYKPGETRKMI S TWTAVWVKI CASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GG TTGGAAAAAGAC T CCTGTAACCC TCC TCCAGGATGAAC CACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCC^GAGCTCCCATGCTTCTCTGCGCAATATCCATTCC^TC^CCCCACACAA 
CTCATGGCCAGGAT TGAGTCC TATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
T GAATGGAGGCAT T GAGAACACAT TAGAGAAGGAGGTGAT GCAGTAT GACT ACTAT TC TT CATAT 
TTT GATATAT T T CT T CTGGCAGTT T T T CGAT TTAAAG T G T TAATAC T TGCATATGC TG TGTGCAG 
Kl ACT GCGCCAT TGGT GGGCAATAGCGTT GACAACGGCAGT GACCAGTGCCT T TT TAC TAGCAAAAG 

r.i 

%*l TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 

ill CTTGCCTGGAT TGAGACGTGGTTCCTGGAT T TCAAAGTGT TACCTCAAGAAGCAGAAGAAGAAAA 

111 CAGACTCC TGATAGT TCAGGATGC T T CAGAGAGGGCAGCACT TATACC T GGTGG TC T T TCT GAT G 

*fi GTCAGTTTTAT TCCCCTCCTGAATCCGAAGCAGGAT CTGAAGAAGCTGAAGAAAAACAGGACAGT 



G AGAAACC AC T T TT AGAAC T ATGAG TAC TAC T T T T G T T AAAT G T GAAAAAC CC T CACAGAAAG T C 
AT C GAGGCAAAAAGAGGCAGGCAG TGGAGT CTCCCT GT CGACAGTAAAGT T GAAATGGTGACGT C 



in C AC T GC T G GC T T TAT T GAACAGC T AATAAAGAT T TAT T TAT T G TAAT ACC T CACAAAC G T T G TAC 

fjj CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
III TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 



AAAAGACT TAAT ATATTGAAGTAACACT T T T TTAGTAAGCAAGATAC CT T T TTAT TTCAAT T CAC 
AGAATGGAATT TTTTTGTT TCAT GT CT CAGAT T TAT T TT GTAT TTC T TT TT TAACACTCT ACAT T 
T CCC T T GTTTT T TAACTCATGCACATGTGCTC T TT GTACAG T T T TAAAAAGTGTAATAAAAT CT G 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTTGCAAAAGGGGAAGAAAGGAATT GCGAATACATGTAAAATGTCACCAGACATT TGTATTATT T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAAT GAC T TAAACCAT TCAT AT CATGT TTCCT T T GCGTTCAGCCAAT T TCAAT TAAAAT GAA 
CTAAATTAAAAA 
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FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTL LW 1 1 ELNVNGG I ENT LEKEVMQYDY YS S YFD I FLLAVFR FKVL I LAYAVCRLRHWWAI ALT T 
AVTSAFLIJ^CVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myri s toyl a tion si te s . 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE is 



F' 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
Wl GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
I*! TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
ft CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
^ TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
g TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
?j TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
ff| CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
^ TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
M : ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
Ifl GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
f| GGCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
p"J AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
Etl GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
fll GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 

CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGC7UVCGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
T CCAGGCGGACCCT GGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
G GC T G TGT GAC T C T AG TC T T GGC CC CAG GAAG C GAAG GAAC AAAAGAAAC T GGAAAGGAAGAT G C 
TTTAGGAACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCT 
GGGAAGATGTT T TTCAAACTCAGAGACAAGGACTTTGGT TTTTGTAAGACAAACGATGATAT GAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVT QLRPNAT YSVCVMPLGPGRVPE GEEACGEAHT PPAVHSNHAPVT QAREGNLPLL I APALAAV 
LLAALAAVGAAYCWRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATTCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
AT GAAGAAGGCAGCCAGGATGAATC CT TAGAT T CCAAGAC TACT TTGACAT CAGATGAGT CAGTA 
AAGGAC CATACT ACTGCAGGCAGAGTAG TTGC TGGT CAAATAT T TCT TGATT CAGAAGAATCTGA 
ATTAGAATCCTCTATTCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATATCAGCTT T CTAGAGT CTCCAAATCCAGAAAACAAGGACTAT GAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGAT GGCAGAC TGTGGTGTG 
CTACAACC TAT GAC TACAAAGCAGATGAAAAG T GGGGCT T TTG TGAAACTGAAGAAGAGGCTGCT 
AAGAGACGGCAGAT GCAGGAAGCAGAAATGATGTAT CAAACTGGAATGAAAATCCTTAATGGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTAT CT CCAAAAGGCAGCAAGCATGAACCAT A 
CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
GCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGCTCTTGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 
AAT AAT AT TAAC AT CAGAAGAAT TTGTGGTT TAT AGC GG C CACAAC T T T T T C AGC T T T CAT GAT C 
CAGAT TTGCT T GTAT T AAGAC CAAATAT TCAGT TGAACT T CCT T CAAAT T C TTGT TAATGGATAT 
AACACATGGAAT CTACAT GTAAAT GAAAGTTGGTGGAGT CCACAAT T TT T CTT TAAAATGAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAAT TGTCAGAATCATT T T TTACAT TAGAT TAT CAT AAT T T TAAAAATT T T TCT T TAG T TT T TCA 
AAAT T TTGTAAATGG T GGCTATAGAAAAACAACATGAAATAT TATAC AATAT T T T GCAACAATGC 
CC T AAGAATT G T TAAAAT TCAT GGAG TTAT TTGTGCAGAATGACTCCAGAGAGC T C TACT T TCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGATTAACTCATTT TTAATAAAATTATGTCTAAGAT TAAAAAAZU\AAAAAAAAAAAAAAAAAAA 
AAAAAAAAAA?UU\AAAAAAMAAAAAAAAAAAAAAAA?yy\AAAAAAA 



FIGURE 18 



MRVRIGLTLLLCAVLLSIASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMytYQTGMKILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
SSQAKALVYYTFGALGGNLIAHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites, 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class- II protein. 

amino acids 101-119 
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FIGURE 10 

AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 
GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCTTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACAT T T T T ATCAC TGGATGTGACTCGGGC T T TGGAAACTTGGCAGCCAGAAC TT T T GAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
C CC GGCGTGC TGGCTC C CACTGACT GGCTGACAC TAGAGGAC TACAGAGAACC TAT TGAAG T GAA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TAT GCAGTGGAAGGTT TCAATGACAGCT TAAGACGGGACATGAAAGC T T T T GGTGTGCAGG TCT C 
ATGCATTGAACCAGGATTGTTCAAAACAAACTTGGCAGATCCAGTAAAGGTAATTGAAAAAAAAC 
T CGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGT TACATTGAAAAA 
AGT C TAGACAAACT GAAAGGCAATAAATCCTATGTGAACATGGACCT CT CT CCGGTGGTAGAGT G 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
G CAG AG C T GG C T AAT CC CAAGGC AG T GTGA C T CAGC TAAC C ACAAAT G T C T CC T CCAG GC TAT GA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 

MLB^VLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSSRLRTVLLDVTDPENVKR^ 

REPIEWLFGLISVTLNMLPLVKKAQGRVIOTSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKIAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVNMD 
LSPWECMDHALTSLFPKTHYAAGKDAKIF^ 

Important features of* the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GG T GAAGCCAAGAACAGCAT TACTGAT T CCCAAATGGAT GAT G T T GAAGT T GT T TATACAATTGA 
CATTCAGAAATATATT CCATGCTAT CAGCTT TTTAGCTTTTATAATTCTTCAGGCGAAGTAAATG 
AGCAAGCACTGAAGAAAATAT TATCAAAT G TCAAAAAGAATGTGGTAGGT T GGTACAAAT T CCGT 
CGT CAT TCAGATCAGATCATGACGT TTAGAGAGAGGCTGCTTCACAAAAACTTGCAGGAGCATT T 
TTCAAACCAAGACC TT G T TT T TCT GCTAT TAACACCAAGT ATAATAACAGAAAGCTGC TCTACT C 
ATCGACTGGAACAT TCC T TATATAAACCTCAAAAAGGACT TT T TCACAGGGTACCTT TAGTGGT T 
Pi GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
ffl T T T TAGCC GAGCAGTACAAACACACAGC TC TAAAT T T TT TGAAGAAGATGGATCCT TAAAGGAGG 

ill T ACATAAGATAAAT GAAATGTAT GCT T CAT T ACAAGAGGAAT TAAAGAGT ATAT GC AAAAAAGT G 

I ft 

l % GAAGACAG TGAACAAGCAG TAGATAAACTAGTAAAGGATGTAAACAGAT TAAAACGAGAAAT TGA 

g|J GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 

R ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 

|3 ATGTCT T TAAAAAATAGACATGT T T C T AAAAGTAGCT GTAACTACAACCAC CATCTCGAT GTAGT 

tfl AGACAATCTGACCTTAATGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGT^ 

!!? AAATCATTAAGCATAAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTA 

f P 

GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
H 1 GAGCAGCCCAGAAACAGATGAAGAAAT T GAAAAGAT GAAGGGT T T T GG TGAATAT TCACGG TCT C 

CTACAT T T TGATCC T TT TAACC T TACAAGGAGAT TT T TT TATT TGGCTGAT GGG TAAAGCCAAAC 
AT TT CTAT T GT T TT TACTATGT T GAGCTAC T TGCAGTAAGT TCATT T GTT T T TACTATGT T CAC C 
T GTT TGCAGTAATACACAGATAACT CT TAG TGCATT TACT TCACAAAG T ACT TT T TCAAACATCA 
GAT GC TT T TAT T TC CAAACCTT TT T T TCAC CT TTCACTAAGT T GTTGAGGGGAAGGC T TACACAG 
ACACATTCT TTAGAAT TGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
T AGGGAAGACAAGT CAGGAGGAT TGAT TGAAGC TAGGAGT TAGAGACCAGCCTGGGCAACG TAT T 
GAGACCATGTCTAT TAAAAAATAAAAT GGAAAAGCAAGAATAGCC TTATTT TCAAAATATGGAAA 
GAAAT T TAT AT GAAAAT T T ATC T GAGT CAT TAAAAT TCTCC T T AAGT GAT AC T T T T T T AGAAG T A 
CAT TAT GGC T AGAGT T GCCAGAT AAAAT GCT GGAT AT CAT GCAAT AAAT T T GC AAAAC AT CAT C T 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEVl^QALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QI QAAREKN I QKDPQENI FLCQALRT FFPNS E FLHSCVMSLKNRHVSKS SCNYNHHLDWDNLT L 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



III 



GGCACAGCC GCGCGGCGGAGGGCAGAGT CAGC C GAGC CGAGT CCAGCC GGACGAGCGGACCAGC GCAGGGCAGC CCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
rj| CAAGAGAT GGAGGAGAAGT ACAGCCT GCAAAGTAAAGATGATT T CAAAAGTGTGGTCAGC GAACAGTGCAAT CATTT G 

CAAGCTGT CTTT GCTT CACGTTACAAGAAGTTT GATGAAT T CTT CAAAGAACTACTT GAAAATGCAGAGAAAT CCCT G 
AATGATAT GTTT GTGAAGACAT AT GGCCATTTAT ACATGCAAAATT CT GAGCTATTT AAAGATCT CTT CGTAGAGTT G 
L|| AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 
TT CCGCCT GGT GAACT CCCAGTACCACTTTACAGAT GAGTAT CT GGAATGT GTGAGCAAGTATACGGAGCAGCTGAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TT AGC GGT T GCGGGAGAT GTCGT GAGCAAGGTCTCCGT GGTAAACCCCACAGC CCAGT GTACCCAT GCCCT GTTGAAG 
p AT GATCTACTGCTCCCACTGCCGGGGTCTCGTGACT GTGAAGCCATGTTACAACTACTGCT CAAACATCATGAGAGGC 

IJI TGTTT GGCCAAC CAAGGGGATCT CGATTTT GAAT GGAACAATTTCAT AGAT GCT ATGCTGAT GGT GGCAGAGAGGCTA 

111 GAG GGTCCTT TC AACATTGAAT C GGT CATGGAT CCCATCGATGTGAAGATTTC T GAT GCTATTAT GAACAT GCAGGAT 

f*J AATAGT GTTCAAGTGT CT CAGAAGGT TTTC CAGGGAT GTGGACCCCC CAAGCCCCTC CCAGCTGGACGAATTTCT C GT 

I'll T CCAT CTCT GAAAGT GCCTTCAGTGCTCGCTT CAGACCACATCACCC C GAGGAAC G CCCAACCACAG CAGCT GGCACT 

| J AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 
jffj GTTTGCAACGATGAGAGGATGGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTAC 
CT GTTTGCAGT GACAGGAAATGGATT AGCCAACCAGGGCAACAACCCAGAGGTCCAGGTT GACACCAGCAAACCAGAC 
AT ACT GAT CCTT CGT CAAAT CAT GGCTCTT CGAGTGATGACCAGCAAGAT GAAGAAT GCAT ACAATGGGAAC GAC GTG 
GACTT CTTTGATATCAGTGATGAAAGTAGT GGAGAAGGAAGTGGAAGT GG CTGT GAGTAT CAGCAGT GCCCT TCAGAG 
TTTGACTACAAT GCCACT GACCATGCT GGGAAGAGT GC CAAT GAGAAAGC CGACAGT GCTGGT GT CC GTCCT GGGGCA 
CAGGCCTACCTCCTCACT GTCTT CT G CATCT T GTT CCTGGT TATGCAGAGAGAGT GGAG ATAA TT CT CAAACT CT GAG 
AAAAAGT GTTCATCAAAAAGTTAAAAGGCACCAGTTAT CACTT TTCT ACCATCCT AGT GACTT T GCT TTT TAAAT GAA 
T GGACAACAATGTACAGTTT TTACTAT GT GGCCACT G GTTTAAGAAGT GCT GACT TT GTTTT CT CATT CAGTT TT GGG 
AGGAAAAGGGACTGTGC^TTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAAC^GTGTAGGTAC^GAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTT TTT T C CAACT GT GATCTC GC CTT GTTT CTT ACAAGCAAACCAGGGTCC CTTCTT GGCACGTAACATGT ACGT AT T 
T C TGAAAT ATTAAATAGCT GTACAGAAGCAGGTTTT ATTT ATC AT GT TAT CTTAT TAAAAGAAAAAGC C CAAAAAGC 



App_ID=l 0063546 



Page 173 of 3 18 



FIGURE 24. 



MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAWASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELiO^YYWGOTNLEEM^ 

QLKPFGDVPRKLKLQVTRAFVAART FAQGLAVAGDWSKVSWNPTAQCTHALLKMI YCSHCRGL 
VT VKPC YNYCSN IMRGCLANQGDLDFEWNNF I DAMLMVAERLEGP FN I E S VMDP IDVKI S DAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWS SLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 

In KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 

g| KSANEKADSAGVRPGAQAYLL TVFC I LFLVMQREWR 

ill 
HI 

y. =■ 
' r» 
?p 

p| amino acids 1-22 



Important features: 
Signal peptide: 



ATP/GTP-binding site motif A (P-loop) 

amino acids 515-524 



m 

PI N-glycosylation site. 

I'll amino acids 514-518 



Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
T CAAGGGCAAT GT GAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCAT T CCAGA 

L % GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 

1*1 

AGCGCCCACTCT T C CAAT TAAACATTCTCAGCCAAGAAGACAGT GAGCACACC TACCAGACACT C 

fit* 

£11 TTCTTCTCCCACCTCACTCTCCCACTGTACCC^ 

III TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 

i'ii 

GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGT GTCAT T TAACCT TAAATGCAATCAGGAAAG T AGCAAACAGAAGTCAATAAATAT T TT T 
AAATGTCAAAAAAAAAAAAAAAAAA 



m 
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FIGURE 26 

MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites* 

amino acids 27-33, 46-52 
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FIGURE 27 

GGACGCCAGCGCC T GCAGAGGCT GAGCAGGGAAAAAGC CAGTGCC CCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGT TCC TGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTT TGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
GAGCCCACCT GGAAACACAT TGGGGATGGC TGCTGC CT CACCAGAGAGACCTGGAAGGATCTT GA 
GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACTCAAAGAGGCAGAGGT T T TG TTCTCAAAT AT T T T T TAATAAATAGACGAA 
ACCACG 



FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQE 

KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 



:! : 

hi 



Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 



s N-myristoylation sites. 

CI amino acids 64-70, 78-84, 80-86, 91-97, 201-207 

ill 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGC TGGGAGATAGGGAACAGAAGAGGGTAGT GGGTGGGC TAGGGGGGCTGCC T TAT TT AAA 
GTGG T TGT TTAT GAT T C T TATACTAATT TATACAAAGATAT TAAGGCCC TGT T CAT TAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 3Q 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 5U 



GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATC ATG TCGG 
GAAGAGATACAAT C CTTGGCC TGT GTATCCT CGCAT TAGCCT T GTC T TT GGCCAT GATGT T TAC C 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTT GAT T T T TG T TC TCAGAAAGAGAATAAAAT TGACAGT TGAGCT T T TCCAAAT CAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
||l TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
III ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
"'ll AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
J|I TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
g TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
0 GAT TCCGAGAAT CAT TGTCATGT ACAT GCAAAACGCACT GAAAGAACAGCAGCATGGTGCATTGT 

CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AAC CAGAATGCATATACTACAAC T GCT AT TAATGGGACAGAT T TCTGTACAT CAGCAAAAGATGC 
ATT GAAAATC T TGT CCAAGAAC T CAAGTCACT T TACATCTAT TAAC T GCT T TGGAGAC T T CATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAAT GAT GGAT CGTCAGAAAAGC CC TACT T TAT GGATCAAGAAT T T C T GAG T T TCGTA 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAAC T CCAGGCCATTGTGAGATAGATACC CATT TAGGTAT CT GT ACC T GGAAAACAT T 
TCCT TCTAAGAGCCAT TTACAGAATAGAAGATGAGACCACTAGAGAAAAGTTAGTGAAT TT TTTT 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 

MS GRDT I L GLC I LALAL S LAMM FT FR FI T T LL VH I FI S LV I LGLL FVCGVL WWL Y YDY TNDLS I E 
LDTERSNMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWVLWAVLLSLGTAGAAQYMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAIKGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLS FVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

b 

p| Important features: 

fit Signal peptide: 

til amino acids 1-20 

Putative transmembrane domains: 

* amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

I*! 

l?i N-glycosylation sites. 

If! amino acids 204-208, 295-299, 313-317 

III 

||| N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 

ATAAT TT G TATGGGATT T G TGAT GCAGGAAAGCCTAAGGGAAAAAGAATAT TCATTC TGT GT GGT 

GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 

TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 

AT T CAAACAAAGAAACGGCAAAGAAGAT TAAAAGGCCCAAGT T CACT GT GCCTCAGATCAACT GC 

GAT G TCAAAGCCGGAAAGAT CATCGATCC TGAGTT CAT T GTGAAAT GTCCAGCAGGATGCCAAGA 

CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 

TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 

GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 

TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 

CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 

ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 

CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 

GCCACAGGAGCC AGGAGATGGAT C T CTGGTCCACTGCCAC C TACACAAGCAGCCAAAACAGGCC C 

AGAGCTGATCCAGGTATCCAAAGGCAAGAT CC T TCAGGAGC TGCCT T CCAGAAACCTGTT GGAGC 

GGATGTCAGCCTGGGAC T TGTTCCAAAAGAAGAAT T GAGCACACAGT CTT T GGAGCCAGT AT CC C 

T GGGAGATCCAAAC TGCAAAAT TGACTT GTCG T TT T TAAT TGAT GGGAGCACCAGCATTGGCAAA 

CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 

C GGT C CAC TGAT GGG T G T T GT CC AGT AT GGAGAC AACC C T G C T AC T C AC T T TAACC T CAAGAC AC 

ACACGAAT TC TCGAGATCTGAAGACAGCCATAGAGAAAAT TAC T CAGAGAGGAGGACT T TC TAAT 

GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 

GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 

GACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGAAAATGAG 

AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 

GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 

ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 

GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 

AGAGT T T GAGAT TT CCGACACGGACACGCGCATCGGGGCCG TGCAGTACACCTACGAACAGCGGC 

T GGAGTT TGGG T TCGACAAGTACAGCAGCAAGCC TGACAT C CTCAACGCCATCAAGAGGGTGGGC 

TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 

GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 

GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 

GCC CAAGAGGAGC T AGAAGTCAT TGCCACTCACCCC GCCAGAGACCACTCCT TC T T TG TGGACGA 

GT T TGACAACC T CCAT CAG TATGTC CCCAGGAT CAT CC AGAACAT T T GT AC AGAGT TCAACT CAC 

AGCCTCGGAACTGAATTCAGAGCAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTT 

GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 

AAATGTCT TGT TAT TATTC TT TGCCAT CAT GC T T TT T CATATT CCAAAAC T TGGAG T TACAAAGA 

T GATCACAAACGTATAGAATGAGCCAAAAGGC TACATCAT GTTGAGGG T GC T GGAGAT T T TACAT 

T TTGACAATTGTTTTCAAAATAAAT GT TCGGAATACAGTGCAGCCCT TACGACAGGCTTACGTAG 

AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 

T TTTGTCATGACAATGTAGGAAT TGCTGAATTAAAT GT TT AGAAGGATGAAAAATAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAASAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAZU\AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 



m 



& 
m 
m 
m 



FIGURE 34 



MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKI I DPE FIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ES FI VLE SKPKKGVTYPSALTYSS SKSPAAQAGETTKAYQRPP I PGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNVWViWDGWPTDKVEEASRIARESGINIFFIT^ 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 



p LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
|p FKKSKPNKRKLMI L I TDGRS YDDVRI PAMAAHLKGVI TYAI GVAWAAQEELEVI ATHPARDHS FF 



VDE FDNLHQYVPRI I QNICTEFNSQPRN 



III Important features: 

g Signal peptide: 

tli amino acids 1-2 6 



Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 



N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 



Amidation site. 

amino acids 304-308 
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FIGURE as 



CCGAGC AC AGGAGAT TGCCTGCGT TT AGGAGGTGGCTGCGT T GTGGGAAAAGCT AT CAAGGAAG AAATTGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACi^ACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
AC AGAAAAC AAC AAAAAACTTAAGCT TT AATT TCAT CT GGAATTCCACAGT T TTCTT AGCTCCC TGGACC C 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 

GGATGT CACTGAGATCCCTCAAAT GGAGCCTCCTGCTGCT GTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 



P TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 

P TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 

EM AT TGTC CTTAGAGGAT GAAC ACCT TCT TT ATGGTGACATAAT CCGACAAGAT TTTT TAGACACAT ATAAT A 

*g ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 

"J: AAGACAGAC ACTGATGTT TTC ATC AATACTGGCAAT TT AGTG AAGT AT CT TTT AAACCTAAACC ACTC AGA 

j| J GAAGTTTTTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATATTT 
CTTACC AGGAGT AT CCTT TC AAGGTGTTCCCTCC AT ACTGC AGT GGGT TGGGT T AT AT AATGTC C AGAGAT 

Sr 

f| TTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTTGAAGATGTTTATGTCGGGAT 

||| CTGT TT GAAT TT AT T AAAAGTG AACAT T C ATATTCC AGAAGACAC AAATCTTTTCT T TCT AT AT AG AATCC 

Q ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 

I'll TGGCAGGTCATGCTAAGGAACACCACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAG 



W GATACC TTGT GGAAAGTGTT AAAT AAAGT AGG TACT GT GGAAAATTCATGGGGAGGTC AGTGTGCTGGCT T 

AC ACTGAACT GAAACT C ATGAAAAAC CCAG AC TGGAGACTGG AGGGT T ACACTTGTGAT T TAT T AGTC AGG 
CCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAAGAAATTAATAGG 
ACCAAAC AATTT GGACAT GTC ATT CT GT AG ACTAGAAT TT CTTAAAAGGGT GT T ACTGAGTT AT AAGCT C A 
CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTGAACAATGTAGTCACTTGAAGGTTTTGTGTA 
TATCTT ATGT GG AT T ACCAATT T AAAAAT AT ATGTAGT TCTGTGTCAAAAAACTTCT TCACT GAAGTT AT A 
CT GAACAAAATTTT ACCT GTTTTT GGTCATTTATAAAGTACTTCAAGATGTTGCAGTATT TCACAGTT ATT 
AT TATT T AAAATT ACT TC AACTTT GT GT TT TT AAAT GT T T TG ACGATT TC AAT ACAAG AT AAAAAGGATAG 
TGAATCATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCACTCCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AATATAGAGAAG AAT T AAAGCAAGAAAATCTG AAAA 



AppJD=10063546 



Page 185 of 318 



FIGURE 36 

MASALWTVLPSRMSLRSLKWSLLLLS 

TLREHSNCSHQNPFLVILVTSHPSDVKM^QAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYI^LTLKTIl^AFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KP I KFEDVYVG I CLNLLKVN I H I PE DTNL FFL YRI HLDVCQLRRVTAAHG FS S KE 1 1 T FWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site, 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 27 



CGCTCGSGCJ^CCAGCCGCGGCARG GATGG AGCTGGGTTGCTGGACGCAGTTGGGGCTCftCTTTTCTTCAGCTCCTTCTCATC 
TCGTC CT TGCCAAGAGAGTAC AC AGTCAT TAATGAAGCC T GCCCTGGAGCAGAGTGGAAT ATCATGTGTCGGGAGT GCTGTG 
AATATGATCAGAT TGAGTGCGTC TGCCCCGGAAAGAGGGAAGTCGTGGGTT AT ACCATCCC T TGC TGCAGGAATGAGGAGAA 
TGAGTGT GACT C CTGC CTGATCC ACCCAGGTTGTACCATC TT TGAAAAC TGCAAGAGC TGC CGAAATGGCTCATGGGGGGGT 
ACC TTGGATGAC TTCT ATGT GAAGGGGTTCTAC TGTGCAGAGTGCCGAGCAGGCT GGTAC GGAGGAGACTGCATGC GATGT G 
GCCAG GTTCTGCGAGC CCCAAAGGGTCAGAT T T TGTTGGAAAGCTATC CCC TAAAT GCTCAC TGTGAATGGACCAT T CATGC 
TAAAC CTGGGT T TGTC ATCC AAC T AAGATT TGTCATGTTGAGTC T GGAGTT TGAC TACATGTGCCAGTATGACTATGTTGAG 
GTTCGTGATGGAGACAACCGCGATGGCCAGATCAT CAAGCGTGT CTGT GGC AACGAGCGGCCAGCTCCTATCCAGAGCATAG 
GATCCTCACTCCACGTCCTCTTCCACTCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCA 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATACTGGGCAGCGC TGTGAAAATC TCCT TGAAGAAAGAAACTGCTC AGACC C TGGGGGCCCAGTCAATGGGTACC AGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
TCTTAGTGGCAATGAGAAAAGAACTTGCCAGCAGAATGGAGAGTGGTCAGGG 

GAACCAAAGAT T TCAGACC T GGTGAGAAGGAGAGT TCTTCCGATGCAGGTT CAGT CAAGGGAGACACCATTACAC CAGC TAT 
ACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGA 

ATACC AACATC TGCAT ACCC AGC T CCAGTATGAGT GCATCTCAC C C T TC TAC C GCCGCCTGGGCAGCAGCAGGAGGACAT GT 
CTGAGGACT GGGAAGTGGAGT GGGCGGGCACC ATC C TGCATCCC TATC TGCGGGAAAATTGAGAACATCACTGCTCCAAAGA 
CCCAAGGGT TGCGCTGGCCGTGGC AGGCAGCCATC TACAGGAGGACCAGCGGGGTGC ATGACGGCAGCCTACACAAGGGAGC 
GTGGTTC C TAGT CTGCAGCGGTGCCC TGGTGAATGAGCGCAC TGTGGTGGT GGCTGCCCAC TGTGTTACTGAC C TGGGGAAG 
G TCACCATGAT C AAGACAGCAGAC C TGAAAGTTGTTTTGG GGAAAT T C T ACC GGGATGATGACCGGGATGAGAAGACCATCC 
AGAGCCTACAGATTTC T GC TATC AT TCTGCATCCCAACTAT GACCCC AT CC T GC T TGATGC TGACATCGCCATCC T GAAGC T 
CCTAGACAAGGCCCGT ATCAGCACCCGAGTCC AGC CCAT C TGCCTCGC T GC C AGTCGGGAT CTCAGCACTTC C TT CCAGGAG 
TCC CACATCAC T GTGGC TGGC TGGAATGTCCTGGCAGAC GTGAGGAGCCC T GGC T T C AAGAACGACACACTGCGC TC TGGGG 
TGGTCAGTGTGGTGGAC TCGC TGC T GTGTGAGGAGCAGC ATGAGGACC ATGGC ATC C CAGT GAGTGTCACTGATAACATGT T 
CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAGCAT C T CCTGAGCC ACGC TGGC ATCT GATGGGACTGGTCAG CTGGAGC T ATGAT AAAACATGCAGCCACAGGCTCTCCA 
CTGCCTT CACCAAGGTGCTGCCT T T TAAAGACTGGAT TGAAAGAAATATGAA ATGRA CCAT GCTCATGCACT C CTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 
CTACTAGGACAGCCAATTGGAAGATGCCAGGGCTTGCAAGAAGTAAGTTTCTTCAAAGAAGACCATATACAAAACCTCTCCA 
C TCCACTGACC TGGT GGTC TTCCC CAACT TTCAGTTATACGAATGCC AT CAGC TTGACCAGGGAAGATC TGGGCT TCATGAG 
GCCCCTT T TGAGGCT C T CAAGT T C TAGAGAGCTGCCTGTGGGACAGC CC AGGGCAGC AGAGCTGGGATGTGGTGCATGCC T T 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 5*8 



MELGCWTQLGLTFLQLLL I SSLPREYT VINE ACPGAEWNIMC RECCE YDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAP IQSIGSS LHVLFHS DGSKNFDG FHAI YEE I TACS S S PC FHDGTCVLDKAGS YKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALP FGDLPMGYQHLHTQLQYECI S P FYRRLGS SRRT CLRTGKWSGRAPSCI P I CGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKTIQSLQISAIILHPNYDPILLDADIAI LKLLDKARISTRVQPICLAASR 
|| DLS TS FQE SHI T VAGWNVXADVRS PG FKNDTLRSGWSWDS LLCEEQHEDHG I PVSVTDNMFCA 

111 SWEPTAPSDICTAETGGIAAVS FPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 

m 



ERNMK 

Important features of the protein: 
Signal peptide: 



si 

|^| 

III amino acids 1-23 

M EGF-like domain cysteine pattern signature. 

fll 

pf amino acids 2 60-272 

P|j N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 

N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTC CTACAT CCTCT CATCTGAGAATCAGAGAGCAT AAT CTT CT TAC GGGCCCGT GATTT AT T AAC GTGGCTTAAT C 
T GAAGGTT CTCAGTCAAATT CTT TGT GAT CTACTGATT GT GGGGGCAT GGCAAGGTT T GCTTAAAGGAGCTT GGCT GG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAG AATGA AGGCGCTTCTGTTGC 
TGGTCTTGCCTT GGCT CAGTCCT GCTAACTACATT GACAAT GT GGGC AAC CT GCACT T CCTGTATT CAGAACTCT GTA 
AAGGT GCCTCCCACT ACGGCCT GACCAAAGATAGGAAGAGGC GCTCACAAGATGGCT GTCCAGAC GGCT GT GCGAGCC 
T CACAGCCACGGCTCC CT CCC CAGAGGTTT CTGCAGCTGC CACCATCT CCTTAAT GACAGACGAGCCT GGC CTAGACA 
AC CCT GCCT ACGT GT C CTCGGCAGAGGACGGGCAGCCAGCAAT CAGC CCAGT GGACT CTG GCCGGAGCAACCGAACTA 
GGGCACGGCCCTTTGAGAGATCCAC T ATTAGAAGCAGATCATTT AAAAAAATAAATCGAGCTTT GAGT GTT CT TC GAA 
£ ^ , GGACAAAGAGC GGGAGT GCAGTT GCCAACCAT GCCGACCAGGGCAGGGAAAATTCT GAAAACACCACT GCC C CTGAAG 

f** T CTTT CCAAGGT TGTACCACCT GATT CCAGAT GGTGAAAT T ACCAGCATCAAGAT CAATCGAGTAGAT CCCAGTGAAA 

f*\ GCCTCTCTATTAGGCTGGTGGGAGGTAGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGG 
fi% T GATCGC CAGAGACGGCC GGCT ACTGCCAGGAGACAT CATT CTAAAGGTCAAC GGGAT GGACAT CAGCAAT GTCC CTC 



ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCT GCCAGGT GCT GT GGCTGACT GT GAT GCGT GAACAGAAGT TCC GCA 



H I GCAGGAACAAT GGACAGGCCCC GGAT GCCT ACAGACC CCGAGAT GACAGCTTTCAT GT GATT CT CAACAAAAGTAGCC 

f ! « CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 

||| GT GTGGCATATCGACAT GGT CAGCTT GAGGAGAAT GACCGT GT GTTAGCCAT CAAT GGACAT GAT CTT CGAT ATGGCA 

a GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 

f*| GGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAAC^GCAATGGC 

I p CTCCCAAGCCCCTCCATCCTACAATTACTT GTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 

p% GCATGACC GTC GCAGGGGGAGCATCACATAGAGAAT GGGAT TT GCCTATCTATGT CAT CAGT GTT GAGCCC GGAGGAG 

f Jf T CATAAG CAGAGATGGAAGAAT AAAAACAGGTGACATTTT GTT GAATGTGGATGGGGT CGAACTGACAGAG GTCAGCC 

m GGAGT GAGGCAGT GGC ATTATT GAAAAGAACAT CAT CCTC GATAGTACTCAAAGCTTT GGAAGT CAAAGAGTATGAGC 

HI CCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 

m 

GGGTCATGT GGCTG GAATTACCACGGTGCTTGTATAACTGTAAAGATATT GTAT T ACGAAGAAACACAGCT GGAAGT C 
T GG GCTTCTG CATT GT AGGAGGT TAT GAAGAATACAATGGAAACAAAC CT TTTT T CAT CAAATCCATT GTT GAAGGAA 
CACCAGCAT ACAATGATGGAAGAATTAGAT GTGGT GATATT CTT CT TGCT GTCAATGGTAGAAGT ACAT CAGGAAT GA 
T ACAT G CT T GCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATT ACT CTAACTAT TGTT TCTT GGC C T GGCACTT 
TTT TATAGAAT CAAT GATGGGT CAGAGGAAAACAGAAAAAT CACAAATAGGCTAAGAAGTT GAAACACT ATATTTATC 
TTGTC^GTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTC^^ 

ACACCTCAGAAAATAT GATTC CAAAAAAATTAAAACTACTAGTTT TTT TT CAGT GTGGAGGATTT CTCAT TACT CT AC 
AACATTGTT TAT ATTTTTT CTATTCAATAAAAAGCCCTAAAACAACT AAAATGAT T GATTT GTATACCC CACT GAATT 
CAAGCTGAT TTAAAT TTAAAATTTGGTATATGCT GAAGT CT GCCAAGGGT ACATTAT GGC CATTTTTAATTT ACAG CT 
AAAATATTT TTT AAAATGCATTGCT GAGAAACGTT GCTTT CAT CAAACAAGAATAAAT ATT TTT CAGAAGT T AAA 
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FIGURE 4Q 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALSVLRRTKSGSAVMTHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHII IQHIYRDGVIARDGRLLPGDI ILKVNGMDI SNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNH3MAPP 
S DWS PSWVMWLELPRCLYNCKDIVLRRNTAGSLGFC I VGGYEEYNGNKP FFIKS IVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 

ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTT GCCT GCAGTGAAGCAGAGAGATAGATAT TATTCACGTAATAAAAAACATGGGCT TCAACCT GACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGAC CCTCATTT T GGGGAAGGGAAAAACTCTGACTAATGAAGCAT CCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTT TACAGAGGGT CGC CATCCTCGTTCCCCACC GGAACAGAGAGAAACACCTGATGTACCTGCT GGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
ACT GCTT TATATTCCACGAT GTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGT TACAGTGGATATT TTGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 
GAT GAAGCT CT TACAC CAAGTGT CACGAGTCTGGAGAACAGAT GGGTTGAGTAGTTGTTCTTAT AAAT 
TAGTATC TGTGGAACACAAT CCTT TATATATCAACAT CACAGTGGATTTCT GGTTTGGTGC ATGA CCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGAT 
CAT GAGGGTTAAATAT T GTAATAT GGATACTT GAAGGACT TTATATAAAAGGATGACTCAAAGGATAA 
AATGAAC GCTATTTGAGGAC TCTGGTTGAAGGAGATT TATTTAAATTT GAAGTAATATATTATGGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGG CG TC CACAAAAGAGGG GAGAAAAG G C GAC GAAT CAGGACACAGT GAAC T T GG GAAT GAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTT TAAAGAGT TT TTGTAAAATGATTTTGTACAAGTAGGATATGAAT TAGCAGTT TACAAGTT T ACAT 
ATT AACT AATAATAAATAT GTCTATCAAATACC TCT GTAGTAAAAT GT GAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHIoMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins, 

amino acids 191-202 
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FIGURE 43 



GCT CAAGACCCAGCAGTGGGACAGC CAGACAGACGGCACGATGGCACTGAGCTCCCAGAT C T GGG 
CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 
ACGGGACAACTT GCAGAGC TGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGC TGGATGCCCAT 
GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 
ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 
TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 
T CC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 





ru 
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FIGURE AJ. 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CI FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 4S 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAAT ATG GCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GAC C TTCAACACAACCCCT CT T GTCACCATACAGCCAGAAGGGGGCACT AT CATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCT TCATCTGCGTT GCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GTT TCTGAAGAGAGAGAGACAAGAAGAGTACAT TGAAGAGAAGAAGAGAG T GGACATT TGT CGGG 
AAACTCCTAACATATGCCC CCATTC T GGAGAGAACACAGAGTACGACACAATCCC TCACACTAAT 
AGAACAATCC TAAAGGAAGATCCAGGAAATAC GGTT TACT CCACTGT GGAAATACCGAAAAAGAT 
GGAAAATCCCCACTCAGTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 46 



MAGSPTCLTL I Y I LWQLTGSAASGPVKELVGS VGGAVT FPLKSKVKQVDS I VWT FNTT PLVT I QP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VAE^NPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 



Important features: 
m Signal peptide: 

81 amino acids 1-22 

■p Transmembrane domain: 

|k amino acids 224-250 

Leucine zipper pattern. 

Ill amino acids 229-251 

m 

U\ N-glycosylation sites. 

f|| amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 

291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 

AACAGTAATGCCAAT T G TGAAT T T T CAT T GAAAAACAT CAGTGACAT TCAT CCAGAAT CCT T CAA 
M CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 

W ACACCAT GGCGAG TGGCTGGAGAGCATCTAGT T TCCACT T CGATTCT GAAGAAAACAAACAT AGG 

|g CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 

|JJ CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 

If! TG TAG TT TAATGGGAATAAAATGTAAGTATCAGTAGT TTGAAAAAAAAAAA 

i 

p 
m 

f'i 

ft 
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FIGURE 48 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVESDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRACCNNRTGMFLS SFFSVI TVIGALYCML I S I QALLKGPLMCNS PSNSNANCE FS L 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites . 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAG T CACACAGT TGCTGACCAAT TGAGCT GTGAGCC T GGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAG T GACCC T GGCCCT T CT CC TAC T GGCAGGCCTGACTGCCT T GGAAGCCAATGAC C 
CAT T TGCCAATAAAGAC GATC CCT T CT ACTATGACT GGAAAAACCT GCAGCTGAGCGGAC T GAT C 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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id 



I || 

III 

m 



FIGURE so 



MERVTLALLLLAGLTALEA^ 
SQKQHS PVPEKAI PLITPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 



amino acids 41-47, 45-51, 84-90 



Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 



AppJD=10063546 



Page 200 of 318 



FIGURE 51 



GT GGACTCT GAGAAGCCCAGGCAGT T GAGGACAGGAGAGAGAAGGCT GCAGACC CAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCAC T GGGACAAAT AT TGGGGAGGC CC T T GGACATGGCC T GGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
fa AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
P T TGGCAGACAGGCAGAAGATGTCAT T CGACACGGAGCAGATGCT GT CCGCGGCTCCTGGC AGGGG 

fef GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

lis 

L| CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
HI GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 
GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
M 5 AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
||| AAT G GCAGC AG C AGT GG T GGCAG CAG C AGT GG CAG C AGCAG T G GCAG C AGC AGT GGC GG C AGCAG 

Q T GGCGGCAGCAGTGGT GGCAGCAGT GGCAACAGTGGTGGC AGCAGAGGTGACAGCGGCAGTGAGT 

I'll CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
M CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
' C T T CAGAGGACAGGGAG T T TC CAGC AACATGAGGGAAATAAGC AAAGAGGGCAAT CGCC T CCT T G 

GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGT GGAGGAGGTGACGCT GT T 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE S2 

MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTRELftVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
S SGS S SGS S SGGS SGGS SGGSSGNSGGSRGDSGSE S SWGS STGS S SGNHGGS GGGNGHKPGCEKPGNE 
ARGSGESGI QGFRGQGVSSNMRE ISKEGNRLLGGSGDNYRGQGS SWGSGGGDAVGGVNTW SETS PGM 
FNF DT FWKN FKS KLG F I NWDAI NKDQRS S RI P 



Signal peptide: 
amino acids 1-21 

N-glycosylation site. 

amino acids 265-269 



Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 



Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242 f 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
CCT CAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACAT CAGCCTCATGACCT TGGACAGTCTA 
CAGAAATGCAT CT T CAGCT T TGACAGCCAT TGTCAGGAGAGGCCCAGTGAATATAT TGCCACCAT 
CT T GGAGC TCAGTGCCCT TGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGAC TT TCTGT 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
CAT TGTCAGAT GAGGATATAAGAGCAGAGGCTGACACCTT CATGT T T GGAGGCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGC T T CT GAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCT GGCC C 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCT CATCGATAT TATAGGGGTCCAT CACAACCCAAC TGT GTGGCCGGATCCTGAGGTC TACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 



FIGURE 

MSLLSLPWLGLRPVAMS PWLLLLLWGSWLLARILAWTYAFYNNCRRLQCFPQPPKRNWFWGHLG 
LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFMKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTWPDPEVYDPFRFDPENSKGRSP 
LAFIPFSAGPRNCIGQAFAMAEMKVVIALMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 55 

ATC GCAT C AAT T G GGAG T ACCAT C T T C C T CATGGGACC AGT GAAACAGC T GAAGC GAAT GT T T GA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT TCATGGCCAGT TT TATGAAGCT TTGGAAGGCACTAT GGACAGAAGCTGGTGGACAG T T T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GT GATTCGAACAT T TGAGGGT TACT T T T GGAAGCAACAAT ACAT TCT CGAACCT GAATGT CAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCTGGATGTTGTCCCACTGAATTCCCATGAATACAAACCTATTCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 56 

MGPVKQLKRMFEPTRL I AT IMVLLC FALTLCS AFWWHNKGLAL I FC I LQS LALTWYS LS FI PFAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen -binding domain protein. 

amino acids 30-72 
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FIGURE S7 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTT CAAGCTCAGT GTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGT GGAAGCAGAAAATTGTACAAGCT 
GGAGATAAGGACCTT GAT GGGCAGCTAGACTTT GAAGAAT TTGT CCAT TATCTCCAAGATCATGAGAAGAAGCTGAGG 
CT GGT GTTT AAGATT TT GGACAAAAAGAAT GAT GGACGCATTGACGC GCAGGAGATCATGCAGT CCCT GCGGGACTTG 
GGAGT CAAGATATCT GAACAGCAGGCAGAAAAAATTCTCAAGAGCAT GGAT AAAAAC GGCAC GAT GAC CAT C GACTGG 
AACGAGT GGAGAGACT ACCACCT CCTCCAC CC CGTGGAAAACAT CCC C GAGATCATCCT CT ACT GGAAGCATT C CACG 
AT CTTTGAT GT GGGT GAGAATCTAACGGT CCC GGATGAGT T CACAGT GGAGGAGAGGCAGACGGGGAT GTGGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGT CCAT G CCT C C CGCAGCAACAACAT GGGCAT CGTT GGTGGCTT CACTCAGATGAT TCGAGAAG GAGGGGCCAGG 
TCACT CTGGCGGGGCAATGGCATCAACGT CCTCAAAATTGCCCCCGAATCAGCCATCAAATT CAT GG C CTAT GAGCAG 
ATCAAGCGCCTTGTT GGTAGTGACCAGGAGACT CTGAGGATTCACGAGAGGCTTGTGGCAGGGT CCTTGGCAGGGGCC 
AT CGC CCAGAGCAGCAT CT ACC CAATGGAGGTCCTGAAGACC CGGAT GGC GCTGCGGAAGACAGGCCAGTACTCAGGA 
AT GCT GGACTGC GCCAGGAGGAT CCTGGCCAGAGAGGGGGTGGCCGCCTT CTACAAAGGCTATGT CC CCAACATGCTG 
GGCAT CAT CCCCTAT GCCGGCAT CGACCTT GCAGTCTACGAGACG CT CAAGAAT GCCT GGCT GCAGCACTAT GCAGTG 
P'| AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
TTC^WVCATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
||| GCTGT GAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCT GGGCGTGCAGT CGCGGTGACGGGGGGAGGGC 

CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGC^GCCA^ 

AAGCT GT CT CGAGCCAAG CTGT GAAAACCCTAGACGCACCCGCAGGGAGGGT GGGGAGAG CT GGCAGG CCCAG GGCT T 
GT CCT GCT GACCCCAGCAGACC CTCCTGTT GGT T CCAGCGAAGACCACAGGCATT CCTTAGGGT CCAG GGTCAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
g CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
«i| ATAATCCATGATGAAAGGTGAGGTCACGT GGCCTCCCAGGCCTGACTT CCCAACCTACAGCATTGACGCCAACTT GGC 

■Is'l TGTGAAGGAAGAGGAAAGGATCTGGCCTT GTGGTCACTGGCAT CTGAGCCCTGCTGAT GGCTGGGGCT CTCGGGCATG 

||1 CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGCCT GGACCCT GT CAGGATGGGCCCCACCT CAGAAC CAAACT CACT GTC C CCACTGT GGCATGAGGGCAGT GGAGCA 
!*! CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
f|| ACTGT TGGGAAAAGGGTTTT GT C CAGAAGGACAAGCC GGACAAAT GAGCGACTTCTGT GCTT CCAGAGGAAGACGAGG 

^ GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
M GG GAC CAGCCCCACATTCCACTT GTGT CACTGCTTG GAAC CTATT TATTT T GTATTT ATTT GAACAGAGT TATGT CCT 

fU AACTATTT TTAT AGAT TTGT TT AATTAAT AGCTT GT CATTT TC AAGTT CAT TTTTTATTCATATTTAT GT T CATGGT T 

GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAAClAATGCCGGCCAACTGGCGACCTCACGGTTGCtACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAAT CTT CAAT AGGATGCAAAGAT CAAT GCAAAAATTGT TATATAT GAAC AT ATAACT GGAGT C GT CAAAAAG 
CAAATTAAGAAAGAATT GGACGTTAGAAGTTGT CAT TTAAAGCAGCCT TCT AATAAAGTTGT TT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 



f 

O 

q 
m 
W 

m 



App_ID=l 0063546 



Page 207 of 3 18 



FIGURE *8 



MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT IDWNEWRDYHLLHPVENI PE 1 1 LYWKHS T I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVTJ4QVHASRSNNM^^ 

FMAYEQ I KRLVGS DQE T LRI HERLVAGS LAGAIAQS S I YPME VLKT RMALRKT GQY S GML DCARR 
I LAREGVAAFYKGYVPNMLG 1 1 P YAG I DLAVYET LKNAWLQHYAVNS ADPGVFVLLACG TMS S T C 
GQLAS YPLALVRTRMQAQAS IEGAPEVTMS SLFKH I LRTE GAFGLYRGLAPNFMKVT PAVS I S YV 
VYENLKI TLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 



App_ID=10063546 



Page 208 



FIGURE 5Q 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAAACTTTCTGATAT 
CGTGATACAATGGCTGAAGGAAGGT GT T TTAGGC T T GGTCCATGAGT TCAAAGAAGGCAAAGAT G 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CAT CAC TTCTAAAGGCAAGGGGAATGCTAACC TTGAGTATAAAACTGGAGCCTTCAGCAT GCCGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
III CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 

J! 



111 T CAACAACACAT ACTCC TGTATGAT TGAAAATGACATT GCCAAAGCAACAGGGGATATCAAAGT G 

ACAGAATCGGAGAT CAAAAGGCGGAGTCACCTACAGCT GCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
P GT GCCTTGGC CACAAAAAAGCAT GCAAAGTCAT TG T TACAACAGGGATCTACAGAACTATT TCAC 

pi CACCAGATAT GACC TAG T TT TAT AT TT CT GGGAGGAAAT GAATTCATAT CTAGAAGTC TGGAGT G 

|*1 AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
CTAT CT TCAAAGACATAT TAGAAG TT GGGAAAA.TAATT CATGTGAACTAGACAAGTGTGT TAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
T GT AATGT TGCT CT GAGGAAGCCCCTGGAAAG TCTATCCCAACATAT CCACATC T TATAT T CCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCTT CCCAAC TGACAAAT GCCAAAGT TGAGAAAAAT GATCATAAT TT TAGCATAAACAGAGCAGT 
C GGGGACACCGATT T T ATAAATAAACT GAGCACCT T CT T T T TAAACAAAAAAAMAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



ry 
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FIGURE 60 



MASLGQILFWSIISIII ILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
D I VI QWLKEGVLGLVHE FKE GKDEL SE QDEMFRGRT AVFADQVI VGNAS LRLKNVQLT DAG T YKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVT INOT 
CVS S FFAI SWALLPLSPYLMLK 



Important features: 
Signal peptide: 

ffi amino acids 1-28 

Ill 

III Transmembrane domain: 

iff 

amino acids 258-281 

m 

g N-glycosylation sites. 

|j amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 

111 220-224 

~4 N-myristoylation sites. 

m amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGARTCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
T GGAGGGCCCTATGGACACCCCAAT CCTGGGATGTT CCCCTCTGGAACT CCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
ATTGGTCTTCATTCAATGATGAGACCTGCCTCATGATGATAAACATGTTTGACAAGACCAAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AAT AGTGAGGACCGGGGCTGAGGCCAC ACAGATAGGGGCC T GATGGAGGAGAGGATAGAAGT TGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGGTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCAT GAGAGAGATCT GTCTATGGGACCAGTGGC TTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT T T GGGGCCAAAAG T CCAGTGAAAT T GTAAGCT TCAATAAAAGGAT GAAACTC T GA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYI SMKELKQALWCNWSSFNDETCLMM^ 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGN I RL S FE DFVTMTASRML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 

CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
CAGC TGGGGAAATATT AATCGGTAT CCAGGAGGCAGC T GGGGGAATATT AATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGAT AGAGGGAAACCCAACAT TGGGAGT T AGAGTCCTGC TCCCGCCCCT TGCT G 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGCTCATCTCCAATAAAATAAAAGCACTTAT 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 



FIGURE 64 

MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 



L\> Important features of the protein: 

13 Signal peptide: 

m 

amino acids 1-26 



HI Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 



If s 



N'T 



N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 

.... I 

fi 259-264, 269-274, 270-275, 280-285, 281-286, 305-310 



III 
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FIGURE 6* 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACAT^CAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYSSSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 



m 



N-glycosylation site. 

amino acids 37-41 



N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCC TCTCTGGGCAG T CAGAT CCACCCAG T GCT TAATAGCAGGGAAGAAGGTACT T CAA 
AGACTC TGCCCCTGAGGTCAAGAGAGGATGGGGCTATTCACTT TTAT AT AT T TATATAAAATTAG 
TAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 



MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLVRKLREKRQTE GT YRPS SEEQ FSHAAEARAPQDSKE TVQGCL P I 

Important features: 
Signal peptide: 

amino acids 1-19 




N-glycosylation site. 



Transmembrane domain: 



amino acids 56-80 



amino acids 36-40 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 



amino acids 86-90 




Tyrosine kinase phosphorylation site. 

amino acids 86-94 



N-myri s toyl a ti on si te s . 



amino acids 7-13, 26-32 
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FIGURE 6q 



m 



GC CAGGAAT AACTAGAGAGGAACA ATGG GGTTATTCAGAGGTTTT GT TTT CCT CTTAGTT CT GT G CCT GCT G CACCAG 
TCAAATACT TCCTT C ATTAAGCT GAATAAT AAT GG CTTT GAAGATAT T GT CATT GTT ATAGATC CTAGT GT GC CAGAA 
GATGAAAAAATAATT GAACAAATAGAGGATAT GGT GACTACAGCTTCTACGTACCTGT TTGAAGCCACAGAAAAAAGA 
TTTTTTTTCAAAAAT GTAT CTATATTAAT T CCT GAGAATT GGAAGGAAAATCCTCAGTACAAAAGGCCAAAACAT GAA 
AACCATAAACAT GCT GAT GTTATAGTTGCACCACCTACACTCC CAGGTAGAGAT GAAC CATACAC CAAGCAGT TCACA 
GAATGTGGAGAGAAAGGCGAATAC^TTCACTTCACCCCTGACCTTCTACTTGGAAAAAAACAAAATGAATATGGACCA 
CCAGGCAAACTGTTT GTCCATGAGTGGGCT CAC CT CC GGT GGGGAGT GTT T GAT GAGTACAATGAAGAT CAGCCTTTC 
T ACCGTGCTAAGTCAAAAAAAAT CGAAGCAACAAGGT GTT C C GCAGGTAT CTCT GGTAGAAATAGAGT TTAT AAGT GT 
CAAGGAGGCAGCTGT CT TAGTAGAGCAT GCAGAATT GATT CTACAACAAAACTGT AT GGAAAAGATTGTCAAT TCTTT 
CCT GATAAAGTACAAACAGAAAAAGCAT CCATAAT GTT TATGCAAAGTAT T GATT CT GTT GTTGAATTTT GTAACGAA 
AAAACCCATAAT CAA.GAAGCTCCAAGCCTACAAAACATAAAGTGCAATTTTAGAAGTACATGGGAGGT GATTAGCAAT 
TCT GAGGATTTT AAAAACACCAT ACC CAT GGTGACAC CAC CTCCT CCACCT GTCTT CT CATT GCT GAAGATCAGT CAA 
AGAATTGTGTGCTTAGTTCTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCGCCTAAATCGAATGAATCAAGCAGCA 
AAACATTT CCTGCTGCAGACTGTTGAAAAT GGAT CCT GGGTGGGGAT GGTTCACT TT GATAGTACTGCCACT ATT GTA 
AATAAGCTAATC CAAATAAAAAGCAGTGAT GAAAGAAACACACT CAT GGCAGGAT TAC CTACATATCCTCT GGGAGGA 
f j I ACTTCCAT CTGCT CT GGAATTAAATATGCATTT CAGGT GATTGGAGAGCTACATT CC CAACT CGATGGATC C GAAGTA 

f I CTGCT GCTGACTGAT GGGGAGGATAACACT GCAAGTT CTT GTATT GAT GAAGTGAAACAAAGTGGGGC CATT GTT CAT 

III TTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAATAGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTT 
Iff T CAGATGAAGCT CAGAACAATGGCCT CATT GAT GCTT TTGGGGCTCT T ACATCAGGAAATACTGATCT CTCC CAGAAG 

TCCCTTCAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACTGTCATAATTGATAGTACA 
GT GGGAAAGGAC ACGTTCTTTCT CAT CACATGGAACAGTCT GCCTCC CAGTATTTCT CTCT GGGATC C CAGT GGAACA 
||| ATAAT GGAAAAT T TCACAGT GGAT GCAACT TCCAAAATGGCCT ATCT CAGTATTCCAGGAACT GC AAAGGTGGGCACT 

TGGGCATACAAT CTT CAAGCCAAAGCGAACCCAGAAACATTAACTATTACAGTAACTT CT CGAGCAGCAAAT TCT TCT 
$ GT GC CTCC AATC ACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTT C CCCAGCC CAAT GATTGT TTAC GCAGAA 

|J| ATTCT ACAAGGATAT GTACCTGTT CTTGGAGC CAATGTGACTGCTTT CATT GAAT CACAGAATGGACATACAGAAGTT 

TT GGAACT TTTGGATAAT GGTGCAGGCGCT GAT T CTTTCAAGAATGAT GGAGTCTACT CCAGGTATTT TACAGCATAT 
ACAGAAAAT GGCAGAT ATAGCT TAAAAGT T CGGGCT CATGGAGGAG CAAACACT GCCAGGCT AAAATT ACGGCCT CCA 
CT GAATAGAG CCGCGT ACATACCAGGCT GGGTAGTGAACGGGGAAAT T GAAGCAAACCCGC CAAGACCTGAAATT GAT 
GAGGATACT CAGACCACCTTGGAGGATTT CAGC CGAACAGCAT CCGGAGGT GCATTT GTGGT AT CACAAGT C CCAAGC 
ill CTTCC CT T GCCT GACCAAT ACC CACCAAGT CAAATCACAGACCTTGAT GC CACAGTT CATGAGGATAAGATTATT CTT 

ACATGGACAGCACCAGGAGATAATTTTGAT GTT GGAAAAGTTCAACGTTATATCATAAGAATAAGTGCAAGTATTCTT 
JJg GATCT AAGAGACAGT TTT GATGAT GCTCT T CAAGTAAATACTACTGAT CT GTCACCAAAGGAGGC CAACT CCAAGGAA 

ill AGCTTTGCATTTAAAC(^GAAAATATCTCAGAAGAAAATGCAACCCACATATTTATTGCC^TTAAAAGTATAGATAAA 
AGCAATTT GACATCAAAAGTAT CCAACATT GCACAAGTAACTT T GTT TAT C CCTCAAGCAAATCCTGATGAC ATT GAT 
CCTACACCTACT CCTACTCCTACTCCTACT CCT GATAAAAGTCATAATTCT GGAGTTAATATTTCTACGCT GGTATTG 
TCT GT GATTGGGTCT GTTGTAAT TGTTAACTT T ATT T TAAGTACCAC CAT TTGAACCT TAACGAAGAAAAAAATCTTC 
AAGTAGAC CTAGAAGAGAGTTTT AAAAAA.CAAAACAATGT AAGT AAAGGAT ATTT CT GAAT CTTAAAATTCAT CC CAT 
GT GTGATCATAAACT CATAAAAATAATTTT AAGAT GT CGGAAAAGGATACTT T GATTAAATAAAAACACTCAT GGATA 
T GTAAAAACT GT CAAGATT AAAATTTAAT AGTTT CAT TTATTT GTTATTT T ATTT GTAAGAAAT AGT GATGAACAAAG 
ATCCTTTTTCATACT GATACCTGGTTGTATATTATTT GAT GCAACAGTTT T CT GAAAT GATATTT CAAATT GGAT CAA 
GAAAT TAAAATCATCTATCTGAGTAGT CAAAAT ACAAGTAAAGGAGAGCAAATAAACAACATTT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 70 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVS I L I PENWKENPQYKRPKHENHKHADVI VAPPTLPGRDE P YTKQFTECGEKGE Y 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRI DS TTKL YGKDCQFFPDKVQTEKAS IMFMQS I DS WE FCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAPCHFLLQTVENGSWVGMVHFDS TAT IVNKL I QTKS S DERNTLMAGLPTYPLGGTS I CSGI K 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIYHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANSSVPPITVNAKMNKDWSFPSPMIWAEILQGYVPVLGAOTTAFIESQNGHTEVLELLDNGA 
GADSFKNDGWSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVVNGEIEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYIIRISASILDLRDSFDDALQYNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHMSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT CCTTAGGTGGAAAC CCTGGGAGTAGAGT ACT GACAGCAAAGACCGGGAAAGAC CAT ACGT CCCCGGGCAGGGGT GA 
CAACAGGT GTCATCTTTTT GAT CTCGTGT GT GGCTGC CTT CCTATTT CAAGGAAAGACGC CAAGGTAATT T TGAC CCA 
GAGGAGCAATGATGTAGCCACCT CCTAACCTTCCCTT CTT GAACCCCCAGTTATGCCAGGATTTACTAGAGAGTGT CA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGTAGAGAAGCT GCT CT GTGTGGTGGT TAACTCCAAGAGGCAGAACT C GTT CTAGAAGGAAATGGATGCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTG AATGA TGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CT GGCACT GCC CAGGGCCAACAGCCC CACGGGGAAGGAGGG GTACCAGGC CGTCCTT CAG GAGT GGGAGGAGCAGCAC 
C GCAACTACGTGAGCAGCCTGAAGCGGCAGATC GCACAGCT CAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGG 
AATGGGCAGTAC CAAGCCAGCGATG CTGCT GGC CTGGGTCT GGACAGGAGCCC CCCAGAGAAAACCCAGGCCGACCTC 
CT GGCCTT CCTGCACT CGCAGGT GGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGCAGTG 
IJI CCTTT CGAT AGCTTTACT CTACAGAAGGTGTACCAGCTGGAGACTGGC CT T ACCC G CCACCCCGAGGAGAAGCCT GTG 

«*| AGGAAGGACAAGCGGGAT GAGTT GGT GGAAGCCATT GAAT CAGCCTT G GAGACCCT GAACAAT C CTGCAGAGAACAGC 

CCCAATCACCGTCCTTACACGGCCTCTGATTTCATAGAAGGGATCTACCGAACAGAAAGGGACAAAGGGACATTGTAT 
|| | GAGCT CACCTT CAAAGGGGACCACAAACAC GAATTCAAAC GGCT CAT CTT ATTT C GACCATT CAGCC CCAT CATGAAA 

I ;i | GT GAAAAATGAAAAGCTCAACAT GGCCAACACGCTTATCAATGTTAT C GT G CCT CTAGCAAAAAGGGT GGACAAGTTC 

*l* CGGCAGTT CAT GCAGAATTTCAGGGAGAT GTGCATT GAG CAGGAT GGGAGAGT CCATCTCACTGTTGTTTACT TT GGG 

AAAGAAGAAATAAAT GAAGTCAAAGGAATACTT GAAAACACTT CCAAAGCT GCCAACTTCAGGAACTTTAC CTTCATC 
fj% CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTTT CT GT GAT GTGGACATCT ACTT CACAT CT GAATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAG 
GTATT TTAT CCAGTT CT TTTCAGTCAGTACAAT C CT GGCAT AATATACGGC CAC CAT GAT GCAGT CCCT CC CTTGGAA 
CAGCAGCT GGT CATAAAGAAGGAAACT GGAT TTT GGAGAGACTT TGGATTT GGGATGACGT GTCAGTAT CGGTCAGAC 
TT CAT CAATATAG GT GGGTT TGATCT GGACATCAAAGGCT GGGGCGGAGAGGAT GTGCACCTTT ATCGCAAGTAT CTC 
CACAG CAACCTCATAGT GGTACGGAC GCCT GTGCGAGGACT CTT CCAC CT CTGGCAT GAGAAGC GCTGCAT GGACGAG 
p'| CT GAC CCCCGAGCAGTACAAGAT GT GCAT GCAGT CCAAGGCCAT GAACGAGGCAT CC CAC GGCCAGCT GGGCAT GCTG 

GT GTT CAGGCACGAGATAGAGGCTCACCTT CGCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATGAACTCCCAGA 
}|:| GAAGGATT GTGGGAGACACT TTT TCTTTC CTTTT GCAATT ACT GAAAGTGGCTGCAACAGAGAAAAGACTT CCATAAA 

pi GGACGACAAAAGAATTGGACTGATGGGTC^GAGATGAGAAAGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGA 
J** AATCAAAAT CTC CGCT TTGCCTGCAAAAGTAAC CCAGT TGCACCCTGT GAAGTGT CTGACAAAGGCAGAAT GCTT GTG 

f'll AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AATATTCAT GATTTAAGAGCAGT TTT GTAAAAAATTCATT AGCATGAAAGGCAAGCAT ATT T CT CCTCATAT GAAT GA 
GC CTATCAGCAGGGCT CTAGTTT CTAGGAATGCTAAAATAT CAGAAGGCAGGAGAGGAGAT AGGCTTATTAT GATACT 
AGTGAGTACATTAAGTAAAATAAAAT GGACCAGAAAAGAAAAGAAACCATAAATATCGTGT CATATTT T CC CCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGT CCTTTTAACT GTCTCCGTTTTTTT CTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTACCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTAC 
ATTTTTATATTTTTTAAGAAGATACTTTGAGATGCATTATGAGAACTTTCAGTTCAAAGCATCAAATTGATGCCATAT 
C CAAGGACATGCCAAATGCTGATTCT GTCAGGCACT GAAT GTCAGGCATT GAGACAT AGGGAAGGAAT GGT TTGT ACT 
AATACAGACGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATGAC 
ACT TT CT GCTTT ACAGAAAAGGAAACTCATTCAGACT GGT GATATCGT GAT GTAC CT AAAAGT CAGAAAC CACATT TT 
CT CCT CAGAAGTAG GGACCGCTTTCTTACCTGT TTAAATAAACCAAAGTAT ACC GT GT GAAC CAAACAAT CT CTTT TC 
AAAACAGGGTGCTCCT CCTGGCTTCT GG CT TCCATAAGAAGAAAT GGAGAAAAAT AT ATAT ATAT ATATAT ATAT T GT 
GAAAGAT CAAT C CAT CTGCCAGAATCTAGT GGGAT GGAAGT TTTT GCT ACATGTT AT CCACCCC AGGC CAGGTGGAAG 
T AACT GAATTATTTT T TAAATTAAGCAGTT CTACTCAATCACCAAGAT GCT TCT GAAAATT GCATT TT AT T ACCAT TT 
CAAACTATTTTTTAAAAATAAATACAGTTAACATAGAGTGGTTT CTT CAT T CAT GT GAAAATTAT TAGC CAGCACCAG 
ATGCATGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTT GCT CACAGTAAACT CAT T GTTTAAAAGCT T CAAGAAC 
ATTCAAGCT GTT GGT GTGTTAAAAAATGCAT TGT AT TGATT TGT ACT GGT AGT TT AT GAAATT T AATTAAAACACAGG 
C CAT GAAT GGAAGGT GGTATT GCACAGCT AATAAAATATGATTT GTGGATATGAA 
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FIGURE 72 

MMMVRRGLLAWISRVWLLVLLCCAI SVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 
EQHRKYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNL I VWTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHE IEAHL 
RKQKQKT S S KKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACAC TGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCC TGAGTGT 
GGAAACAATGCC CAGT CGC CCAT CGATATTCAGACAGACAG T G T GACATT T GACCCTGAT TT GCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCC CAGC TCCACCT GCAC TGGGGTC AGAAAGGATCCCCAGGGGGGTC AGAACACCAGATCAACAG 
T GAAGCCACAT TTGCAGAGCTCCACATTGTACAT TATGACTCTGATT CCTATGACAGCT TGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGCT TATGAACACATTCT GAGT CACTTGCATGAAGTCAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
AT T TCAATGGAACAGCTGGAAAAGCT TCAGGGGACAT T GT TCT CCACAGAAGAGGAGCCCTC TAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGT GGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAAT TCC TTCTCAGATAC 
CAT GGAT GTGGATGACTT CCCT T CAT GCC TATCAGGAAGCC TC TAAAAT GGGGT GTAGGAT C TGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCTGTCAT TCCAGGAAGAACTGCAGAGC CT TC AGC CTC T CCAAACATG TAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATATATTTGGAAATTAAAG 
TTTCTGACTTT 



AppJD=10063546 



Page 223 



FIGURE 74 



MLFSALLLEVIWILAADGGQHWTYSGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 




Important features of the protein: 
Signal peptide: 



amino acids 1-15 



Transmembrane domain: 



amino acids 291-310 




N-glycosylation site. 



Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 



amino acids 213-216 
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FIGURE 75 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
T CTGGTGGT T TGCC TAAACCTGCAAACAT CACCT TC T TAT CCATCAACATGAAGAATGTC CT ACA 
ATGGACT CCACCAGAGGGTCT T CAAGGAGTTAAAGT TACT TACAC T GTGCAGTAT TTCAT CACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTC CTG T T TCCAT GCAACAAATATAC TCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 



I , I T CT GTGATGGGC TAT TCCATC TACC GATATATCCAC GT TGGCAAAGAGAAACACCCAGCAAATT T 

|f! GAT T T TGAT T TATGGAAAT GAAT T T GACAAAAGAT T CT TT GT GCC TGCT GAAAAAATCGT GATTA 

4* ACT T TATCACCCTCAATATCTCGGAT GAT TCTAAAAT T TCTCATCAGGATATGAGTT TACTGGGA 

AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
p| GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 
||| AAAACACGGAAGGTACT TCTC TCACCCAGCAAGAGT CCCT CAGCAGAACAATACCCCCGGATAAA 

P ACAGTCAT TGAATATGAATATGATGTCAGAACCACTGACATT TGTGCGGGGCCTGAAGAGCAGGA 

Hf GCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAG 
fe» TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
C CAGACAGGCCACCAGGAGAAAAT GAAACC TATCT CAT GCAAT T CAT GGAGGAATGGGGGT TATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAAT G T T T GCCAGAC T GGGTGCAGAAT TTATT CAGGTG GGT GT 



AppJD-10063546 



Page 225 of 3 18 



FIGURE 76 



MS YNGLHQRVFKE LKLLTLCSISSQI GP PE VAL T T DE KS I SWL T AP EKW KRN P E DL P VSMQQ I Y 
SNLKYWSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKIIFWYVLPISITVFLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAA GATGG CCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
T CGTGGAG TT CCACAT GACGACT GAGGCCCAAGC CACCAT CCGCAT GGACACCAGTGCAAGT GGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
fit TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
£ AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 

TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
p|j TGATCAAT GAAAAGGC TGCAGAT AAGC TGGGATC TACCCAGATCGTGAAGATCC TAAC TCAGGAC 

ACT CCCGAGTT T TT TATAGACCAAGGCCATGCCAAGGTGGCCCAACT GATCGTGCTGGAAGT GT T 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T TT ACACCAAAGG T GACCAAC T T AT AC TCAAC T TGAATAACATCAGC TCTGAT CGGATCCAGCT G 
ATGAACT C TGGGAT TGGCT GGT T CCAACC TGAT GTTCTGAAAAACAT CATCACT GAGATCAT CCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 



MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGIPVLGSLWTVLKHIIWLKVITANILQLQVKPSANTDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMT T EAQAT I RMDT S AS G PTRLVLS DCAT SHGSLRI QLL YKLS FLVNALAKQVMNLLVPS LPNL 
VKNQLCPVI EAS FNGMYADLLQLVKVP ISLS IDRLE FDLLY PAIKGDT IQL YLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEEMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSILLPNQNGKLRSGVPVSLVKALG 
FEAAESSLTKDALVLTPASLWKPSSPVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 



CI 

U J 



GAGAGAAG TCAGCC TGGCAGAGAGACT C TGAAATGAGGGAT TAGAGGTGT T CAAGGAGGAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCT GGAT GGAATGT GC CACACACAGCACAGGCATCACC CAG T G TGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
H CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CAT GAAAT TTGAGAT T GGAGAGGCT CTT TACTT GGGCAT TAT T TCT T CCC T GT TC TCCCT GATAG 
111 CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
*p CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 

rri 

t?: "" GTTCAATTCCTACAGCCTGACAGGGTATGTG TGAA GAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
P| GGTCTGTGAAA?^ACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
||| GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
13 GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAATTTGTGCATGAAC TGAAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGAT G 
CAGGAT GGGAGGACAGGAAGGCAGC CT GGGACAT T T AAAAAAATA 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YSTLLGL PADI QAAQAMMVTS SAI S SLACI I S WGMRCTVFCQES RAKDRVAVAGGVFFI LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 




Signal peptide: 



amino acids 1-24 



Transmembrane domains: 



amino acids 82-102, 117-140, 163-182 




N-glycosylation site. 



PMP-22 / EMP / MP20 family proteins. 



amino acids 190-193 



amino acids 4 6-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
p ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
P GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 

rn 

pjj TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
t-'JI AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
J* ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
p ACTGCCAGAC TCCT GCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 

GTGT GCAG TC GCTC CATGGGGTGAGACAT CCTCAGGATCCAT G T TCCAGT GAT GCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
C AGACCCAAGGGAGCAGGC AGCACAACT GTCAAGAT CGTCC TGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACC C T GGC CACAGT GAGAT CAGT T C T ACC AGG T G T CC C AAG G CAC CGG G CC GGG T C 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACT TGG T GGAGATC TACCTC TGGAAGC T GG TAAAAGAT GAGGAAACTGAGGC TCAGAGAG 
GTGAAGTACCTGGC CCAAGGC CACACAGCCAGAATCT TCCACT TGAC TCAGAT CAAGAAAGT CAG 
GAAGCAAGAC T T CCAGAAAGAGGCACAGCACT T CCGACTGC T CGC TGGCCCCCACGAAGGTCAC T 
GGAACGTCTTCCTAGCCCAGACCCT GGAGC TGAAGGTCACGGCCAGT CCAGACAAAGTGACCAAG 
ACATAACAAAGACC TAACAGT TGCAGATATGAGCT GTATAAT TGT TGTT AT TAT AT AT TAATAAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 8s 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCXCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TC CGT T T C CATGCC GT GAGGTCCATT CAC AGAACACAT CCATGGCTCT CATGCTC AGTTT GGTTCTGAGT C 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGC ATTCTCCT GT TTCCT GTCTCCT AAGACCAAT GC AGAGGCC AT GGAAGT GCGGTT CT TC AGGGG 
CCAGTT CT CT AGCGTGGT CC ACCT CT ACAGGG ACGGGAAGGACCAGCC AT TT ATGC AGAT GCC ACAGT AT C 
AAGGCAGG AC AAAACT GGTG AAGGAT T CT ATT GCGGAGGGGC GC ATCT CT CT GAGGCTGGAAAAC ATT ACT 
I** GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
W ACAGGT GT CAGC ACT GGGCTCAGT TC CTCTC ATTTC CATC AC GGGAT ATGT TGATAGAGACATCC AGCT AC 

TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
AC AGACT CC AGG AC AAAC AG AGACAT GCATGGCCT GTT TGATGT GGAGAT CTCTCTGACCGTCC AAGAGAA 
CGC C GGGAGC AT AT CCTGT T CCAT GC GGCATGCTCATCTGAGCCGAGAGGTGGAAT CC AGGGT AC AGATAG 



01 



111 

r« GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
fh TTT T TT GGCATTGT T G GACTGAAGAT T T T CTTCTCC AAAT TC CAGTG GAAAAT CC AGGCGGAACTGGACT G 

S GAGAAGAAAGCACGGACAGGC AGAAT T GAG AGACGCCC GGAAAC ACGC AGTGGAGGT GACTCTGGATCCAG 

|J| AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
III CCTC AC TCTG AGAAGAGATTT ACAAGGAAG AGTGTGGT GG CTTCTC AGAGTTTCCAAGCAGGG AAAC AT T A 

El CT GGGAGGTGGACGGAGGAC AC AATAAAAG GTGGCGCGT GGGAGTGT GCC GGGATG ATGT GGAC AGGAGGA 

AGGAGT ACGTGACTTT GT CTCCCGAT CATGGGTACT GGGT CCTCAGAC TGAATGGAGAACAT T TGT ATTT C 
AC AT TAAATCCCCGTT TT ATC AG C GT CT T C CCCAGGACCCC ACCT ACAAAAAT AGG GGTC TT CCTGGACT A 
TGAGTGT GGG ACCATCTC CTTCT T CAAC AT AAATGACC AGTCCCTT AT TT AT ACCCT GAC AT GTCGGT TT G 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CCAGTC ACCC AGGAAT C AGAGAAAGAGGCC TCTT GGCAAAGG GCCTCT GC AATCCC AG AGAC AAGC AACAG 
TGAGTC CTCCTC AC AGGC AACC AC GC CCT TCCTCCCCAGGGGTG AAAT GTAG GATG AAT C AC ATCCCACAT 
T CTT CT TT AGGG AT AT T AAGGTCT CT CT CCCAGATCCAAAGT CCCGCAGC AGCCGGCC AAGGTGGCTTCCA 
GATGAAGGGGGACT GGCC TGTCCACAT GGG AG TCAGGT GT C ATGGCT GCC CT GAGCT GGGAGGGAAGAAGG 
CT GACAT T AC AT TTAGTT TGCTCT CACT CC AT CTGGCT AAGT GATCTT GAAAT ACC ACCT CT CAGGTG AAG 
AACC GT C AGG AAT T CCC ATCTC ACAGGC TGTGGTGT AGAT T AAGTAGACAAGGAATGTGAAT AATGCT TAG 
ATCT TAT TGATG ACAGAGTGTATCCT AATGGT TTGT TC ATT AT ATT AC ACT T TC AGT AAAAAAA 



rw 



App_ID=10063546 



Page 233 of 3 18 



FIGURE 84 



MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFKQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGS VPL I S I TGYVDRDIQLLCQS SGWFPRPTAECWKGPQGQDLS T DSRTNRDMHGL FDVE I SL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
Jwfc LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
M PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Q 
IP 

r ii 

5 S. S 



fit 



rs 5 



Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 85 



AA.CAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
TACT TCT TTCGTATGGAGAAAGGAAG TATAAAATGGAAT TATAAACATCACCGGCTCTCT GT GAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
C TGAGGGATGCAGC TGAAT TCACCT GCAGAGC TCAGAACCC TC TCGGC T CT CAGCAGG TCTACC T 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGC TGAT TC T TGTAGAAT TAACAGCCCT CAACGT GATGAGC TATGATAACAC TATGAAT TAT G 
T GCAGAGTGAAAAGCACACAGGCTT TAGAGTCAAAGTAT CTCAAACCTGAAT CCACAC TGT GCC C 
T CC CT TT TATT T T T T T AACTAAAAGACAGACAAAT TCCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKN^ 

KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFWVRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGC T AATAC T TAG T T CAAGGAATGGACCT GT 
TCT TCGTCTCCATC TCTGCCCAGAAGCT GCAAGGAAAT CAAAGACGAATGT CC TAGTGCAT T TGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACAC CT T T GGATCT GCAGAGGCGGCCAC GAGCGATGACTACAAGAACCCTGGCT ACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCC CT GCTGAGGTACCGCACGGACAC T GGC T TCC TCCAGACACTGGGACAT AATCT 
GTT TGGCATC TACCAGAAATAT CCAG TGAAATATGGAGAAGGAAAGT GT TGGAC T GACAACGGCC 
C GGTGAT CCC T GTGGT C TATGATT T T GGCGACGCCCAGAAAACAGCAT CTT AT TACTCACCCTAT 
GGC CAGCGGGAATT CAC TGCGGGATT TGT TCAGTT CAGGGTAT T TAATAAC GAGAGAGCAGC CAA 
CGCCTTGTGTGCTG GAAT GAGGGT CACC GGAT GTAACAC T GAGCATCAC T GCAT T GGT GGAGGAG 

13 

,p GAT ACT T TCCAGAGGCCAG TCCCCAGCAGT GTGGAGATT T T TC T GGT TT TGAT TGGAGTGGATAT 

fn GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 

It? 

f|| T TGAGAGT T T TGTGGGAGGGAACCCAGACCT CTCC T CCCAACCATGAGATC CCAAGGATGGAGAA 

13 CAAC T TACCCAGTAGC TAGAAT GT TAAT GGCAGAAGAGAAAACAATAAATCATAT TGACT CAAGA 

AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVT PWYDFGDAQKTAS YYS PYGQRE FTAG FVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 

is 

I TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTC TGA TTACCTTCA 
I T GACGGGAACCT AAGGACGAAGCCT AC AGGGGCAAGGGCC GC TTCGTAT T C CT GGAAGAAGGAAG 

* GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
I TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
I T TAAGACT TATATACAGTTTTAGGGGACAATTAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSL I S I GAHCSASVALS FFI FERWECTT YWYI FVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 



AppJD=10063546 



Page 240 



FIGURE Qi 



CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
Jwfc GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
M C CAAGACAAT G C CACAG GC CAGGGAAACAT C T CC AT CAGC C TC G T GC C CCC CAG T AAAG CT G TAG 

n 

£E AGT T CCACCAGGAACAGCAGAT C TT CATCGAAGC CAAGGC C TCCAAAATC T T CAAC TGCCGGAT G 

ill 

I . | GAGTGGGAGAAGGT AGAACGGGGCCGC CGGACCT CGCTT T GCACCCACGACCCAGCCAAGAT C T G 

||| CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 

?p T CTACATCGCCT TCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 

CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 

!L TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 

|j| AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 

Q AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 

ill AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 



HI 



TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCT TAAGC TAAGACAGGACGAT TGTGG T CC TCCCACACTAAGGCCACAGCCCAT C 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCC FVFLVQGS L YLVI CGQDDG PPGSEDPERDDHEGQPRPRVPRKRGH I S PKS RPMANS T L 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSPCAVEFHQEQQIFIEAKASKIMCRMEWEKVERGRRTSLCTHDP 
AKI CS RDHAQS SATWS CSQP FKWCVY I AFYS TDYRLVQKVCPD YN YHS DT PY YPS G 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 



CGGTGGCCMGA.CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGrCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATAT CTGCTGATC T TTGGAGCGT TTGTCTCTGTCTATATCCAAGAA 
ATGT TCCGAT T T GC AT AT TATAAACTC T T AAAAAAAGC CAGT GAAGGT TTGAAGAG TATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 



III AACCTCAGGGAACCAGCAC TTCC CAAACCGCAGAC T ACAT C T T TAGAGGAAGCACAAC TGT GCCT 



T T T T CTGAAAATCC CTTT T TC TGGT GGAAT TGAGAAAGAAATAAAAC TATGCAGATA 
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FIGURE QA 



MTAAVFFGCAFI AFG PALALY VFT I AI EPLRI I FL I AGAFFWL VSLL I SS LVWFMARVT I DNKDG 
PTQKYLLIFGAFVSVYIQSMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQT F I S S YYG INLASAF 1 1 LVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains : 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 



AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGG TAGCAATC TGAAAC ATGA GGAGTACGATT CT AC TGT TTTGTCTTC T AGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAG T CAAATCAGGTC TT TC C T TC TT TAAGTCTGATACCAT TAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
CAAT TT T TGTCACACAAC T TGGAGCCCAGGGCACTAT CC TAAGCTCAGAGGAAT TGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 



0 TAAT CCAGAT GT CCAGGATGGAAGCCTTC CAGCAGGAGGAGCAGGTGTAAATCCTGCCAC CCAGG 

M! GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 

J"!!; GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATC^^ 

IN 

p p GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 

||| AT T GAT TAT AT T ATGG AAT AG AT T GAGAC AC AT T G GAT AG T C T T AGA&G AAAT TAAT T C T TAAT T 

P TACCTGAAAATATTCTTGAAATTTCAGAAAATATGTTCTATGTAGAGAATCCCAACTTTTAAAAA 

ft 

pw" CAAT AAT TCAAT GGATAAATCT GT CT TT GAAATATAACAT TAT GCTGCCTGGAT GATATGCATAT 

in 

pi T AAAACATAT T T GGAAAAC T GGAAftAAAAAAAZ^\AAAAAAAAAAAAAAAAAAAAZU\AAAAAAAAA 

HI AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

ft? 
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FIGURE Q6 

MRSTILLFCLLGSTRSIiPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAQ^TPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
L PQ I FT S L I IHS LFPGG I L PT S QAGANPDVQDGS LPAGGAGVNPATQG TPAGRL PTPS G 
TDDD FAVT TPAG I QRS T HAI EE ATTE SANG I Q 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



rii 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGC TGTCTGTGGAAGT TCCAGAAAACTATGG TGGAAAT T TCCCT TTATAC CT GACCAAG T TGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCC T T GACCACGCC CT GGAGAGGAC C TACCAGCTGT T GGTACAGGTCAAGGACATGGGTGA 



W CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
|!l CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
ft T GCAGAGGGAAACC TC TACGT GACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAG TAC C TGC TCC 

AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
*P ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
Er» CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
W T CGAAGTCGCAGTCACAGAT AT CAAT GATCACGCCCC TGAGT T CATCACT TCCCAGAT TGGGCCT 

III ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
flj CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
pi'. GCCT GGAT TGGGAGCCAGACTC TGGGCAT GT TAGAC T CAGAC T C T GCAAGAACC TCAGT TATGAG 

GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT TT T CACC CAC T GGACCAT G T CAAGGAAGAAGGACC CGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCC TGGGAGAGAGCCCAGCACC CAAGATCCAGCAGGGGACAGGACAGAGTAGAAGC CCC TCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACT GCCCATGGGAGT GC TCCAAAT GTCAGGG T GT T T GCCCAATAATAAAGCCC CAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S T S LDHALERT YQLLVQVKDMGDQAS GHQATAT VEVS 1 1 E S TWVS LE P I HLAENLKVL YPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDIKDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVWWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQES YEASVPI SAPAGS FLLT 
I QPSDPI SRTLRFSLVNDSEGWLC I EKFSGE VHTAQS LQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDIASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REH 1 1 PVWSHNAQMWQL LVRVI VCRCNVEGQCMRKVGRMKGMP TKLS AVG I LVGTLVAI GI FL I 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAA.CCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGATTCTTCCCAGGAACACAAACGTAGGAGACCCACGCTCCTGGAAGCACCAGCCTTTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATCT TGC T TTC CAGCCACAAAGAGACAGATGAAGAT GCAGAAAGGAAATG 
TTCT CCT TATGT TT GGTCTACTAT TGCATT TAGAAGCTGCAACAAAT T CCAAT GAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGT CACCAAC TCTGAGT T CCATACAACC T CCAGTGGGAT CAGCACAGCCACCAACT CTGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
CAACTCT GACTCCAGCACAACC TCCAGTGGGGC TAGCACAGCCACCAACTCTGAG T CCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGC CACCAAC TC T GACTC CAGCACAGTGTCCAGTGGGGCCAGCAC TGC CACCAACT CTGAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
C TGAGTC CAGCACAGTGTCCAGTGGGATCAGCACAG TCACCAAT TCTGAG TCCAGCACACCC TCC 
AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
M GCCACCAACTCTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAACTCTGAGTCTAG 
| : || CAC AGTGTCCAGTGGGATCAGCACAGTCACCAAT TCT GAGT CCAGCACAACC TC CAG TGGGGCCA 

fl ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
PJl ATGCACACAAC T TCCCATAGTGCATCTAC T GCAGTGAGT GAGGCAAAGCC TGGTGGG T CCC TGGT 

H * GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 

TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 
ACAGC GGGCCC TGA GCAGCCCC GGAAGCAAGTGCCGCAT TC TT CAGGAAGGAAGAGACCT GGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT C T TGAAGAAGG TAT TCCTCACCT T TCTTGCCTT TAC CAGACACTGGAAAGAGAATACTATAT 
T GCTCAT T T AGC TAAGAAAT AAATACAT C TCATCTAACACACACGACAAAGAGAAGC TGTGCTTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT C T GGCAT TCAAAATC TCCACAGTAAAATCCAAAGAC C T CAAA&AAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



Li 



m 

VI? - 
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FIGURE 100 

MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVS IVTNSEFHTTSSGISTATNSSFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
j, ffc TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
f j STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
P AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 

P AMEMSGRNSGP 

III 

yi 

F £J Signal peptide: 

fit amino acids 1-20 

is 

Transmembrane domain: 

g*t amino acids 510-532 

M 

$ ?;f 

ril 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGAT G1 GATAT ATGC AT T CCAGGGGAAGGGAAAT TGTGGT GCT TC TGAACC CAT GGTCAAT T 
AAC GAGGCAGT TTCTAGC TAC T GCACGTACTT CATAAAGCAGGACTCTAAAAGC T TT GGAATCAT 
GGT GT CATGGAAAGGGAT T T ACT T TATAC TGAC TCTGTT T T GGGGAAGCT x T TT T GGAAGCATT T 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
g*| CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT TATAACTGGGGATGCAT T T G TTCCTGGAGAAAGAAGTGTCAT TAT CATGAACCAT CGGACAA 
GAATGGACT GGATG T TCC TGT GGAAT TGC C TGATGCGATATAGCTACC TCAGAT TGGAGAAAAT T 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 



\- ■-■ ■■■■ 

Hi 

111 



m 

Q 

m 



C T T CATT CATAGGAAATGGAAGGAT GACAAGAGCCAT T T CGAAGACAT GAT TGAT T ACT TTTGTG 
f|| ATAT T CACGAAC CACT TCAACT CCTCATAT TCCCAGAAGGGAC TGATCTCACAGAAAACAGCAAG 

?■ TCTCGAAGTAATGCAT TTGC TGAAAAAAAT GGAC T T CAGAAATATGAATAT G TT T TACATCCAAG 

AACTACAGGCT T TAC T TT TGTGGT AGACCGTC TAAGAGAAGGTAAGAACC T TGATGCT GTCCATG 
ATATCACT GTGGCGTATC CT CACAACATTCCT CAATCAGAGAAGCACC TCC TCCAAGGAGACT T T 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGGAGAAGAAT T T TTATT T T ACCGGACAGAG TGTCAT TCCACC T TGCAAGT C TGAACT CAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATT TGTACAG T C T TGT TAAGTGG TAT T T TATAAT CACCATT G TAATCT T TGT GCTGCAAGAGA 
GAATATT TGGTGGACTGGAGATCATAGAACTTGCATGTTACCGACTT T TACACAAACAGCCACAT 
TTAAATTC^^GAAAAATGAGTAAGATTATAAGGTTTGCC^TGTGAAAACCTAGAGCATATTTTG 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCAT CAT TAT T TGT TAAAGAT AT T T TGCAC T TAAT T T T GT GGGAAAAATAT TGC T ACAAT T T T T 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGCCAGAATAT TAT TAAACAATCAT CAGGCTTTTAAA 
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FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFWPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLK^^ 

LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSP^MCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
k LEIIELACYRLLHKQPHLNSKKKE 

! 

I Important features of the protein: 

5 Signal peptide: 

£ amino acids 1-22 



I Transmembrane domains : 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 



I: A 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAAT CATCCATCCAC CCC T GCT GTCAT CTGTT T T CATAGT G TGAGAT CAACCCACAGGAATA 
TCC ATGG CTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CAC CT C TAC AGAGATGGGGAAGACTGGGAATCTAAGC AGAT GCCACAGTAT CGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 



| 5 s ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
|p GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
fl% TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
f I AAATC CAAAGGGAAAATCCAGGCGGAACTGGAC TGGAGAAGAAAGCACGGACAGGC AGAATTGAG 

r!f AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
m T T T CT GAT CTGAAAACT G TAACCCAT AGAAAAGCTCC CCAGGAGGT GCCT CAC T C T GAGAAGAGA 

JS TT T ACAAGGAAGAG T GT GGTGGCT TCTCAGGGTTT CCAAGCAGGGAGACAT TACT GGGAGG TGGA 

gf| CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACT TTGTCTCCCAACAATGGGTATTGGGTCCT CAGACTGACAACAGAACATTT GTAT TTC 
5^ ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
Itf GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
1!! TGACAT GTCAGT T TGAAGGCTT GT TGAGACCCTATAT CCAGCATGCGATG TAT GACGAGGAAAAG 

f| GGGACTC CCATAT TCATATGTC CAG T GT CC TGGGGATGAGACAGAGAAGACCC T GC T T AAAGGGC 

»l CCCACACCACAGACCCAGACACAGCCAAGGGAGAGT GCT CCCGACAGGTGGCCCCAGC TTCC TC T 

|}f CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
M TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
IP TCAGAAGCCATGGC TGCC CTGAAGTGGGGACGGAATAGACTCACAT TAGGT T TAG T TT GT GAAAA 

CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGTGAT T CC TGC CTCACAGGTGAAGAT TAAAGAGACAACGAATG TGAATCAT GCT T GCAGGT T 
TGAGGGC AC AGTGT T TGC TAAT GATGTGT T TT TATAT TATACAT TT TCCCACCATAAAC T C TGT T 
TGCT TAT T CCACAT TAAT T TAC T T T T CTC TATACCAAAT CACC CATGGAATAG T TAT T GAACACC 
T GCT TTGT GAGGCTCAAAGAAT AAAGAGGAGGTAGGATT T TTCACT GAT TCTAT AAGCCCAGCAT 
TACCTGATACCAAAACCA.GGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
C T CAT T AAC ACAGACACAAAAAT T C T AAAT AAAAT T T T AAC AAAT T AAAC T AAACAAT AT AT T T A 
AAGAT GATATATAACTAC TCAGT GTGG T T T GTCCCAC AAAT GCAGAGTT GGTT TAATAT TTAAAT 
ATCAACCAGTGTAATTCAGCACATTAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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FIGURE 104 

MAFVL I LVLS F YELVS GQWQVT GPGKFVQALVGEDAVFS C S L FPET S AEAMEVRF FRNQ FHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
I VQENAGS I LCS IHLAEQSHEVESKVLIGETFFQPS PWRLAS I LLGLLCGALCGWMGMI IVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDYGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 



AppJD-10063546 



Page 254 
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b 

P 
ill 



it 



CCTT C AC AGGACTC TTCATT GCTGGTTGGC AATGA TGT ATCGGC C AGATGT GGT GAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT C ACTGT T CAT T ATGTG AGAT ATAATCAAAAGAAG ACCT AC AATTACT ATAGC AC AT TGTC AT T T AC AAC 
TGACAAACTATATGCTGAGTTTGGCAGAGAGGCTTCTAACAATTTTACAGAAATGAGCCAGAGACTTGAAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTC AACAGAAGCATGGAGT GT T GGCTCAT AT GCTGTT GAT T TGT AGATTTCACTCT ACTGAGGATCCTG A 
AACT GT AG AT AAAATTGTTC AACT TGTTTT AC ATGAAAAGCT GC AAGATGCT GT AGGACCCC CT AAAGT AG 
AT CC T C ACTCAGT TAAAATT AAAAAAATCAACAAGACAGAAACAGACAGCTATCT AAACCAT TGCT GCGG A 
AC ACGAAG AAGT AAAACT CT AGGT CAGAGT CT CAGGATCGT T GGTGGG AC AGAAGT AG AAGAGGGT GAATG 
GCCCT GGC AGGCT AGCCT GCAGTGGGATGGGAGTCATCGCT GTGGAGCAACCTT AATT AATGCC AC AT GGC 
TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
AT AAAACCTT CGAAAATGAAACGGGGTCTCCGGAGAAT AATT GT CC AT GAAAAAT AC AAACACC CATC ACA 
TGAC T ATGAT AT TT CTCT TGCAGAGCTTTCTAGCCCTGT TC CCT AC AC AAAT GC AGT ACATAGAGTT T GTC 
TCCCT G ATGC AT CCT ATG AGTT TC AACC AGGT GAT GTG ATGT TT GTGAC AGGATT TGGAGCACT GAAAAAT 
GATGGT T ACAGT CAAAATCATCTT CG ACAAGCACAGGT GACT CT CATAGACGCT AC AACTTGCAATGAACC 



T C AAGC TTAC AATGACGCCATAACTCCT AG AATGTT AT GTGCTGGCTCCTTAGAAGGAAAAACAGAT GCAT 
p| GCCAGGGTGACTCT GGAGGACCACTGGTTAGT TCAGATGCTAGAGATATCTGGT ACCTTGCT GGAAT AGTG 

|*J AGCT GG GGAGAT GAATGTGCGAAACCC AAC AAGCCTGGTGTT TAT ACT AGAGTT AC GGCCTT GCGGGACTG 

GAT T ACTTCAAAAACTGGTATC TAAGAG ACAAAAGCCT CATGGAAC AGAT AACATT TTTT TT TGTTTT TTG 



|*| GGT GTGGAGGCC AT T TTT AGAGAT AC AG AATT GGAG AAGACT TGCAAAAC AGCT AGATTTGACT GAT C TCA 

fW ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GAT C AACTCTGT CAT CTGTGAGCAAT AGTTGAAACT TT ATGT AC AT AGAG AAAT AG AT AAT ACAAT AT TAC 
AT T ACAGCCT GT AT T C ATTT GT T CTCTAGAAGTTT T GTCAGAAT TT TG ACT TGT T G AC AT AAAT T TGT AAT 
GCAT AT AT AC AATT T G AAGC ACT C CTTTTCTT CAGT TCCTCAGC TCCTCT CATT TC AGCAAAT ATCC ATTT 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGT ATT AGGTGTT T TTCTT AGTGGAAT ATTAGAAATGATC ATATTCATT AT GAAAGGTCAAGC AAAG AC A 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
AT AT ATCCTTAT TT TCATTT CC AAAC AACT AC TATG AT AAAT GT GAAGAAGATTCTGTTTTT TTGTGACCT 
AT AAT AAT TAT ACAAACT TC AT GCAATGT ACT TGTTCT AAGCAAATT AAAGC AAAT AT TTAT TT AAC ATTG 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 



App_ID=10063546 Page 255 of 318 



FIGURE 106 



MMYRPDWRARKRVCWE PWI GLVI FI SL I VLAVCI GLTVHYVRYNQKKTYNYYS TLS FT TDKLY 
AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVI^ 

PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRI IVHEKYKHPSHDYDI SLAELS S PVPYTNAVHRVCLPDAS YEFQPGDVMFVT G FGALKNDGYS 
QNHLRQAQVTLI DATTCNEPQAYNDAI TPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 107 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCATTTTGACTTACTTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GAT AT GGTCACT TTGAAAGCCAGGATGACAAAT TATGGAT T AC CTAGATATCGGTGGCT TACTCA 
TGC T T GGAAT T T TTT TCAGAGAGAG TT TAAGTGCTGT GGAGTAG TATAT T TCAC TGAC TGGT TGG 
AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
CAGGCCCACCAGGAAGAT CTCAGT GACC T T TAT CAAGAGGGT T GTGGGAAGAA&ATGT AT TCCT T 
TTT GAGAGGAACCAAACAACT GCAGGTGC TGAGGTT T CT GGGAATC TCCAT TGGGGTGACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAATGAT GTCC T T GAAGAATGACAACTC TCAGCACC TGTCATG TCCCTCAGTAGAACT 
GT T GAAACCAAGCC TGTCAAGAAT CT TT GAACACACATCCATGGCAAACAGCT T TAAT ACACACT 
T TGAGATGGAGGAGTTATAAAAAGAAATGTCACAGAAGAAAACCACAAAC T TGTT TTATTGGACT 
TGTGAAT T T T TGAG TACATACT AT GT GT T TCAGAAATATG TAGAAATAAAAATGT T GCCATAAAA 
TAACACC TAAGCATAT AC TAT T CTAT GCT T TAAAATGAGGATGGAAAAGT T TCATGTCAT AAG T C 
ACCAC CT GGACAATAAT T GAT GCCC T TAAAATGC TGAAGACAGATGTCATACCCACTGTGTAGCC 
TGT GTAT GACTTTTACTGAACACAGTTATGTTT TGAGGCAGCATGGT TTGAT TAGCATT TCCGCA 
TCCAT GCAAACGAGTCACATATGGTGGGACTGGAGCCATAGTAAAGGTTGAT TTACTTCTACCAA 
CTAG T ATATAAAGTAC TAATTAAATGCTAACATAGGAAGTT AGAAAATAC TAATAACT T TTAT TA 
CTCAGCGATCTAT TCT TC TGATGCTAAATAAAT TATATATCAGAAAAC TT TCAATAT T GGT GACT 
ACC TAAATGTGAT T T T TGC TGGT TAC TAAAATAT TCT TACCAC T TAAAAGAGCAAGCT AACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
T TCAGT TCTGATAAT G TTAAGAATAACCAT TAT GAAAAGGAAAATT T GTC CT GTATAGCATCAT T 
ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGTTATTTAAATACTTAACCACTAATTTTGAAAATTACCAGTGTGATACATAGGAATCATTATTC 
AGAAT GTAGTCTGG TC TT TAGGAAGT AT TAATAAGAAAAT T T G C AC AT AAC T TAG T TGATTCAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GCT TATC TT TGCCT T C TC CAAACAAGAAGCAATAGTC TCCAAGTCAATATAAAT T CTACAGAAAA 
TAGTGTT CT TT T TC T C CAGAAAAATGC T TGTGAGAAT CAT T AAAACAT GTGACAAT T TAGAGAT T 
CTTTGTT TTAT TTCAC TGATTAATATACTGTGGCAAATTACACAGATTATTAAAT TTTT TTACAA 
GAGTATAGT ATATTTATT TGAAAT GGGAAAAGT GCAT T T TACT G TAT TT TGTGTAT T T TGT T TAT 
T TC TCAGAATATGGAAAGAAAAT T AAAATGTGTCAAT AAAT AT T TTC TAGAGAGTAA 
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MAREDSVKCLRCLLYALNLLFWmSISVLAVSAP^RDYLNNVLTLTAETRVEEAVILTYFPVVHP 
VMI AVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQRE FKCCGVVYFTDWLEMTEMDWPPDSCCVRE FPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 




Signal peptide: 



amino acids 1-33 



Transmembrane domains: 



amino acids 12-35, 57-86, 94-114, 226-248 



m 
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FIGURE 1QQ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGT CT T TCTCGT GAGAGCCTAGAGGCCTT AAAAAAAAAAGT GCT T GAAAGAGAAG GGGAC AAA GGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAAC ACT GACCACCAGTTGGAT GAGTCTCAAGGTCCTCCTCTATGTGAC AACCATGT GAAT GGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
C AGCT GCT GAG AT AC C AGC GAGT GC AC ATGC GCT C C AGGAACT GT GCT AGGC CCT GACAGGC AGACAT GCT TT G 
ATGAAAATGAATGTGAGCAAAACAACGGTGGCTGCAGTGAGATCTGTGTGAACCTCAAAAACTCCTACCGCTGT 
GAGT GT GGGGT T GGC C GT GT GCT AAGAAGT G AT GGCAAGACT T GTGAAGACGTT GAAGGAT GCC ACAAT AAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CTGAGGATAACCACACTTGCCAAGTCCCTGTGTTGTGCAAATCAAATGCCATTGAAGTGAACATCCCCAGGGAG 
CT GGT T GGT GGCCT GGAGCT CT T CCTGACC AAC AC CTCC TGCC GAGGAGT GT CCAAC GGC ACCC AT GT C AAC AT 
CCT C T T CT CT C TC AAGAC AT GT G GTAC AGT G GT CGAT GT GGT GAAT GACAAGAT T GT GGC CAGC AACC T CGT GA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACC T GC GAGTT T C CAC GCC T GT AC ACC ATT T CT GAAGGAT AC GT TCC CAAC C TT CGAAAC T CC C C ACT GGAAAT 
CAT GAGCC G AAAT CAT GGGATCT TCCC AT TC AC T C T GGAGAT C T TC AAGGAC AAT GAGTT T GAAGAGC CT T ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TT GGAAAGCTT G GT GGAGAGCT GCT TT GCC ACC C C C ACC TCC AAGAT CGAC GAGGT CCTGAAATACT ACCT C AT 
CCGGGAT GGCT GT GT TTC AGAT GACTC GGT AAAGCAGTACACAT CCC GGGAT CACC T AGC AAAGCACT TCC AGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGC GGCCCGATCCGCATCGACT GGGAGGACTAGTTCGT AGCCATAC CTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
C AGGTCACAGCACTGCT GAACAATGTGGCCT GGGT GGGGTTTC ATCTTTCT AGGGT TGAAAACT AAAC TGT CCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAAT C AG AAGC T G GGT AT AAT AT TTC AAGT T AC AAACC C T AGAAAAAT T AAAC AGT T AC T GAAAT T AT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
T TAT AGG GAAT T T GGAAGT GT AT CAAT AAAACAGTATAT AATTT T 



FIGURE no 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRS DGKTCEDVEGC HNNNGGCSHSCLGSEKG Y QCEC PRGLVL SEDNHTCQVPVLCKSNAI EVNT PREL VGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVWDKIVASNLVTGLPKQTPGSSGDFIIRTSECLLIPVT 
CE FP RL YT I S E G Y VPNL RN S PLE IMS RNHG I FP FTL E I FKDNE FE E P Y REAL PT LKLRDS LY FG I E P WH V 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCG VL DE RS RC AQ GC HRRMRRGAGG ED S AGLQGQT LTGG P I RI DWE D 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites . 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGGCAGCAGCTTGCT C AGCGG AC AAGGATGCTGGGCGTGAGGG ACCAAGGC CTGCCC T GCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCT GCCT TGGGGTGACAAT CT CAGCT CCAGGCT AC AGGGAGACCGGGAGGATC AC AGAGCCAGC ATGT 
T ACAGG AT CCTGACAGTG AT CAACCT CTG AAC AGCC TCGATGTC AAACCCCT GCGC AAACCC CGT ATCCC C 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTT GTCCTC AT CAAGGTGATTCT GG AT AAAT ACT ACTTCCT CT GCGGGC AGCCTCTC CACT T CATCCCG A 
GGAAGC AGCT GT GT GACGGAGAGC TGGACTGTCCCTTG GGGGAGGACG AGGAGCACTGTGTC AAGAGCTT C 
i s CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
|J| AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
P! GCTAC AGCAGAGCT GT GG AGAT TGGC CCAG AC CAGGAT CTGGAT GTTGTTGAAATC AC AG AAAACAGCC AG 

fll GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TCAGCATCCAGTACGACAAACAGCACGTCTGTGGAGGGAGCATCCTGGACCCCCACTGGGTCCTCACGGCA 
GCCC AC TGCT TC AGGAAACATACCGATGTGTT CAACTGGAAGGT GCGGGC AGGC TCAGACAAAC T GGGCAG 
CT TCCC ATC C CT GGCTGT GGCCAAGATCATCATCATTG AAT T CAACCCCATGTACCCCAAAGAC AATG ACA 
TCGCCC TC AT GAAGCTGC AGT T CCC ACTCACT T T CTCAGGCACAGTCAGGCCCATCTGTCTGCCCTT C TTT 
GATGAGGAGC TCAC TCCAGCCACCCC ACTCTGGAT CAT TGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 
GATGTCTGACATACTGCTGCAGGCGTCAGTCCAGGTCATTGACAGCACACGGTGCAATGCAGACGATGCGT 



hi 

If! 
m 



III 

13 

pj ACCAGGGGGAAG TCACCG AGAAGATG AT G T GT GC AGGCATCCCGGAAGGGGGTGTGGAC ACCTGCC AG GGT 

|3 GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
f|| CT GCGGGGGC CC GAGCACCCCAGGAGTATACACC AAGGTCTC AGCCT ATCT CAACTGGAT CT AC AATGTCT 

GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGC AAGAGTCC CCTTGGGT AC ACCCCT CT GC CC AC AGCCTC AGCAT 
T TCTTGGAGC AGCAAAGGGCCT CAATTCCTGT AAGAGACCCT CGC AGCCC AGAGGC GCCC AGAGGAAGTCA 
GCAGCCCT AGCT CGGCCACACT T GGTGCT CCC AGC ATCCCAGGGAG AG AC ACAGCC CACT GAAC AAGGTCT 
CAGGGGTATT GCT AAGCC AAGAAGGAACT T TCCC AC ACTACT GAATGG AAGCAGGC TGTCTT GT AAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCT ACTAGAGCAAGAAACC AGT T GT AAT ATAAAAT GC ACTGCCCT AC TGTTGGT ATGACTACC GTT ACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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FIGURE 112 
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* •• !; 

P 
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MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPI I IALLSLAS 1 1 1 VWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEAI^TACRQMGYSRAVEIGPDQDLDVVEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSIAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMS DI LLQAS VQVI DS TRCNADDAYQGEVTEKMMCAG I PEGGVDTCQGDSGGPLMYQS 
DQWH WG I VS WG YGCGGP S T PGVYTKVSAYLNW I YNWKAE L 



M Transmembrane domain: 

ill 



amino acids 32-53 (typell) 
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FIGURE 113 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACT ATGC TGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGAT TGA CAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCT TCCCTCTGAGAGGCCCT CCTATGTCCC TACTAAAGCCACCAGCAAGACATAGC TGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAATGATCTCCAAGGGCCCTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
C CAGC TAC T CGGGAGGCTGAGACAGGAGAAT TACT T GAACC TGGGAGG TGAAGGAGGCTGAGACA 
GGAGAAT CACTT CAGCCTGAGCAACACAGCGAGAC TC TGTC TC AGAAAAAATAAAAAAAGAAT TA 
TGGTTATTTGTAA 
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FIGURE lid. 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE us 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTG'X'GCTGAGAGACCATGGCAA 
AGAAT CC TC CAGAGAAT TGT GAAGACTGTCACAT TCTAAAT GCAGAAGCT T T TAAAT C CAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TCTACAGCAAT GGAGAGAAGAAGAAGAT T TACATGGAAAT T GAT CCTGTGACCAGAAC T GAAATA 
T TCAGAAGC GGAAATGGCACTGAT GAAACAT TGGAAGTGCACGACT T TAAAAACGGATACACTGG 
M CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
Jj? C T G AAC C AGAAGAGGAAAT AG AT G AG AAT GAAG AAAT T ACC AC AAC T T T C T T T GAACAG T CAG T G 

J|f ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
||| GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAATATCAGTTTCTGAGTTACAAG 
.Iff ACTT T GAGGAGGAGGGAGAAGAT C T T CAC T TTC CTGC CAACGAAAAAAAAGGGATT GAACAAAAT 

*P GAACAGT GGGT GGT CC CTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGT GAGGA 

^ * AGAACTT CCAATAAAT GACTATACT GAAAATGGAATAGAAT T T GATCCCATGCT GGAT GAGAGAG 

p- t GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
Ifl GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
p TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 

111 T GCTGGCAACATAT AAT AAAT GCAT GC TAT TCAATGAAT T T C T GCCTATGAGGCAT CT GGCCCCT 

p% 

g£ GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 

TTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKA.YDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 



Important features of the protein: 

tjl Signal peptide: 

^ amino acids 1-40 

fii 

HI Transmembrane domain: 

tsp amino acids 25-47 (type II) 

If! 

jjj^ N-glycosylation sites. 

lJ£ amino acids 94-97, 180-183 

III Glycosaminoglycan attachment sites. 

O amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 

IP 242 



N-myri s toyl a tion si tes . 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 



Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TAT TT C ACCATC CT GGGACT TCCAGCC AT GCT GCAGGCAGTGCGAGCCCT GAT GAT CGT AGGCATCGT CCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CC AAAGCC AACATGAC ACTGACCT CCGGGATCAT GT TCATTGTC TC AGGTCT T T GTGC AAT T GC TGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGT GC AGAC TGT TC AGACCAGGT ACAC AT T TGGTGCGGCTC T GTTCGT GGGC T GGGTCGCT GGAGGCC 
TCAC AC TAAT T GGGGGTGTGAT GATGTGCATCGCCT GCCGGGGCCTGGCACC AGAAGAAACC AACT AC AAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 

?2 TGGGT C CAACAC CAAAAACAAGAAGAT AT ACGAT GG AGGTGCCC GC AC AGAGGACGAGGT ACAATCT TATC 

Ul 

>. CT TCC AAGCACGACT ATGTGTAATGCTCT AAGACCT CTCAGCAC GGGCGGAAGAAACTCCCGGAGAGC TCA 



m 



CC CAAAAAACAAGGAG ATCC CATCT AGAT TTCTT CT TGCTTTT GACTC ACAGCT GG AAGTT AGAAAAGCCT 
g CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
|J| GCT GAGTT ATT T AT GAATTAGAGGCT AT AGCT CACATTTTCAAT C CTCTATT TCT T TTTTTAAAT AT AACT 

Iff TT CT AC TCT GAT GAGAGAAT GT GGTTTT AATC TCT C TCTC AC AT TTTGATGATT T AGAC AGACT CCC CCT C 

HI TT CCT C CT AGTC AAT AAACCCAT T GATGAT CT ATT T CCCAGCTT ATCCCCAAGAAAACTTTT GAAAGG AAA 

fll GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 



AC ACT T ACTGAAGAAG AAGCAATAAG AGAAAGAT AT TTGT AATC TCTCC AGC CC ATGATCT C GGTTTTCTT 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTT TGAGGCAACCAAACCTTT CT ACTGCTG 
T T GAC ATCTTCT T ATT ACAGCAAC ACCAT TCT AGGAGTTT CC TGAGCT CTCC AC TGGAGT C CTC T TTC TGT 
CGCGGGTC AGAAAT TGTCCCTAGATG AATGAGAAAATT ATT T TT TTT AAT T T AAGTCCT AAATAT AGTTAA 
AATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGAAATAGCCTCACCCCTACATGTGGATAGAAG 
GAAAT G AAAAAATAATTGCT TT GACATTGTCT AT ATGGTACT TT GT AAAGT CAT GC TT AAGT ACAAATTCC 
AT GAAAAGCTCACACCTGTAATCC T AGC ACTT T GGG AGGCT GAGGAGG AAG GAT CACTTG AGCC C AG AAGT 
TC GAGACT AGCC TGGGCAAC ATGGAGAAGCCC T GTC TCT ACAAAAT AC AG AGAGAAAAAAT C AGCC AGTC A 
TGGT GGC AT ACACC TGT AGT CCCAGC ATTCCGGGAGGCTG AGGT GGGAGG AT CACT TG AGCC CAGGGAGGT 
TGGGGCTGCAGT GAGCCATGATCACACCACTGCACTCCAGCCAGGTGACATAGCGAGATCCT GT CTAAAAA 
AAT AAAAAATAAAT AATGGAAC ACAGCAAGTC CT AGGAAGTAGGT T AAAACT AAT T CT TT AA 
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FIGURE 118 

MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRMMIVGIVLGAIGLLVSIF 

CAI AGVS VFANMLVTNFWMSTANMY T GMGGMVQTVQTRYT FGAAL FVGWVAGGLTL I GGVMMC IA 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



GGAAAAAC TGTTCTCTTCTGT GGCACAGAGAAC CC T GC T T CAAAGCAGAAGT AGC AGT T CCGGAG TCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CAT CAG GAT GC AGTGCAAAAT CTAT GAT T CC CT GC T GGC T CTT TC T C C GGAC C TACAGG CAGC CAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
M C^TCACGGGC^TGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATC^TCAGAGATTTCTATA 
ACT CAATAGT GAAT GT T GCC C AAAAAC GT GAGCTT G GAGAAGCTC T C T ACT TAGGATGGACCAC G GC A 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACT CGATACCTTCCCATCGCACAACCCAAAAAAGT TAT CACACCGGAAAGAAGT CACCGAGCG 
I|| TCTACTCCAGAAGTCAGTATGTG TAG TTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 



m 



ACAAAAAT C TAT ATT AC TT T C T CAAAAT G GACCCCAAAGAAACTT T GAT T T AC T GT TCT T AAC T G CCT 
AAT CTTAATTAC AGGAACTGT GCAT CAGC TAT T TAT GATT CTATAAGCT ATTTCAGCAGAATGAGATA 
T TAAACC CAAT G CTTT G AT T GT T CT AGAAAGTATAG T AAT TTGTTTTC T AAGGT GG T T C AAGC AT CTA 
CTC TTTT TATCATTTACTT CAAAATGACAT TGCTAAAGACTGCATTAT T TTACTACT GT AATTTCTCC 
ACGACAT AGCAT T ATG T ACAT AGATGAGT GTAACAT T TAT ATC T C ACAT AGAGACAT GC T T AT AT GGT 
TTT ATTTAAAATGAAAT GCCAGT CCAT TACACT GAATAAATAGAACT CAACTATTGCTT TTCAGGGAA 
f I AT CAT GGATAGG GTT GAAGAAGGT TAC TATT AATT GT TT AAAAAC AGC T TAGGGAT T AAT GTC CTCCA 

jffj TTTATAATGAAGATTAAAATGAAGGCT TTAATCAGCATTGTAAAGGAAATTGAATGGCT TTCT GATAT 

GCT GT TTTT TAGC CTAG GAG T TAGAAATC C TAACT T CTT TATCC T CT T C T C CC AGAGGC T x TT TT TTT 
CTT GT GTATTAAATTAACATT T TTAAAAC GCAGATAT TT TGTCAAGGGGCTTT GCATTC AAAC TGCTT 
T T C CAGG G C T AT ACT C AGAAGAAAGAT AAAAGT G T GATC T AAG AAAAAG T GAT GGT T T T AG GAAAGT G 
AAAAT ATTTTT GT TTTT GTAT TT GAAGAAGAAT GAT GCAT TTT GACAAGAAATCATATATGTAT GGAT 
ATATTTTAATAAGTATT TGAGTACAGACTTTGAGGT TTCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACATT TATAT TAAT AAAT TGTACATTTT TCTAAT T 



App_ID=10063546 Page 269 of 3 1 8 



FIGURE 120 



MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWSGLWMNCVRQANIRMQCK 
IYDSLIALSPDLQAARGLMCAASVMSFLAFM^ 

MWLI PVS WVANAI I RDFYNS I VNVAQKRELGEALYLGWTTALVL I VGGAL FCCVFCCNEKS S S Y 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 



ill 



m 



m 



ill 



Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATTCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAGGAGTCCTGGACACCCAACTAC 

ft* 

|?| AAGCAGT GT TCATGGAGT TCAT TGAATTAT GGCATAGAT C T T GGGAAAAT TGCGGAGT GT ACAT T 

p TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
CP GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 



III 



ATT GAAGC T ATAAT TTAT T T GGACCAAGGAAGCCC TGAAAT GAATTCAACAATTAATAT T CATCG 
> CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
m TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
g ATTAT TGAAGAACTAC CAAA ATAAA T GCT T TAAT T T TCAT T TGC TACC TC T T TT T TTAT T ATGCC 

III T TGGAAT GGT TCAC T TAAAT GACAT T T TAAATAAGT T TAT GTATACATCT GAAT GAAAAGCAAAG 

g| C TAAATATGT T TACAGACCAAAGTG T GAT T TCACACT GT T T TT AAATC TAGCAT TAT T CAT T T TG 

C TT CAAT CAAAAGTGGT TTCAATAT T T TT T TTAGT TGGT TAGAATAC T T T CTTCATAGTCACAT T 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
||J AAAAAATATAAAAGCTACCAATCTTTGTACAAT TTGTAAATGT TAAGAAT T TTTTTTATATCTGT 

TAAAT AAAAAT TAT T T CCAACA 
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FIGURE 122 

MRPQGPAASPQRLRGLLLLLLLQLPAPSSASE I PKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIOTGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 
amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T GCCT GGAAGAATACATCAT GT T T T TCGATAAGAAGAAAT T G TAGGATCCAGT TT TT TT T TTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAACAATACAAAGGATGGGTTTCAATG 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
ATAACAGCCTTCAAAAACTTAAGTATAATCAATTTAAAGGGCTCAACCAGCTCACCTGGCTATAC 
:zt CTTGACCATAACCATATCAGCAATATTGACGAAAATGCTTTTAATGGAATACGCAGACTCAAAGA 
GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGAC T GC CGCAACCTGGAACT T T TGGACC TGGGATATAACCGGATCCGAAGT T TAGCCAGGAATG 
*J* TCTTTGCTGGCATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTC^ 

CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
||! CATAGGACAGACCATGTCCTGGACCTGGAGCTCCTTACAAAGGCTTGATTTATCAGGCAATGAGA 



i: ~ 

m 



rii 
rii 



TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
| ATCGATGCAGTGAAGAACTACAGCATCTGTGGCAAAAGTACTACAGAGAGGTTTGATCTGGCCAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGC CC CC GACGG TGGGAGCCACAGAGCCCGGCCCAGAGACCGATGC TGAC GC CGAGCACATC TCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAG TCCCTAAAGCAAAT GAG T CCCAGCACC CAGGAAT T T T AT GTA 
GATTATAAACCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCCTGCACCTA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGT ATGA ACCATTGTGATAAAAAGAGCTCTTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
G G T CAT TTTCCTCT C AT AC AT AAT C AAC C CAT T GAAAT T T AAAT AC CACAAT CAAT G T G AAGC T T 
GAAC T CC GGTT TAATATAAT ACC T AT TGTATAAGACC C T T T AC TGAT T CCAT TAAT G T CGCATT T 
GT T T T AAGATAAAAC T TC T TTCATAGGTAAAAAAAAAAA 
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FIGURE 124 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQE I PS S I SAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNH I SN I DENAFNG IRRLKEL I LS SNRI S YFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMI RLKELHLEHNQ FSKLNLALFPRLVSLQNL YLQWNK I S VI GQTMS WTWSS LQRL DL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKSFKGLRENTIICASPKELQGVNVIDAVKNYSICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQE FYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 



CCGTT AT CGTCT TGCGCTACTGCl'GAATGX CCG TCCCGGAGGAGGAGGAGAGGCI TT T GCCGC TG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT T GAAGAGGAAGGCT T TCTAAAGCT TT GGCAAGGAGTGACAC CCGCCAT T TACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T TAG C C AAT C CAAC T GAC C T AGT GAAGGT T C AGAT G C AAAT GGAAGGAAAAAG GAAAC T GGAAGG 
f-n. AAAACCATT GCGAT T TCG T GGT GT ACATCATGCAT T T GCAAAAATC T TAGC TGAAGGAGGAATAC 

HI GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
III ACCACTTAT GATACAGTGAAACACTACT TGGTATTGAATACACCACTTGAGGACAATATCATGAC 

in 

TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
ill TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGAC^ 

g ACT GACT GC T TGAT TCAGGCTGT T CAAGGTGAAGGAT TCAT GAG T C TATATAAAGGCT T TT TACC 

p ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 



TGAGTGGAGTCAGTCCATT T TAA 
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FIGURE 126 



MSVPESEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMI^GVIGQFLANPTDLVKVQMQMEGKRKLEG 

NIQRAALVIQMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 



& amino acids 25-38, 130-147, 233-248 

m 

III 

la's 
a? y 

IP 
8 

P 
lit 

m 

ni 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCT GC TGAAT GTAAGAGT GCAGGCTACCCCGGGACT T TGATCCCCTACAGAT GT GACC TAT 
CAAAT GAAGAGGACATCC TCTCCATGT TC TCAGC TAT CCGT TCT CAGCACAGCGG T GT AGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATG T T CAAT G T GAACG T GCTGGCCCT C AG CAT C T G C ACAC GGGAAGC C TAC C AGT C CAT GA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACAT CAATAGCATGTCTGGCCACCGAGTGT TA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAATTCGCC TT CAAACT CCACGACAAGGACCC TGAGAAGGCAGCT GCCACC TATGAGC AA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGACTGTGGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACTTCCTGT CCACACCCCGACCAGGGGCTAGAAAATT TGTTTGAGATTTT TATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAG T GGAT T TCATGGT GAT CATTAAAAAAGAAAAAT CGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGME RWRDRLALVTGAS GG I GAAVARAL VQQGLKWGCARTVGN I EELAAE CKS AG YPGTL I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAG3LARPDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 



Important features of the protein: 
Signal peptide: 

I* it 

m amino acids 1-17 

M 

CI 

111 N-myristoylation sites. 

W amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 

*p 199-205 

m 

5 Short-chain alcohol dehyrogenase . 

CI amino acids 30-42, 104-114 

Ifi 

III 

111 
f*'l 

II 
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FIGURE 12Q 



AACI T CT ACATGGGCCT CC T GC TGCT GGIGC TCT TCCT CAGCC'l'CC IGCCGGTGGCCTACACCA1' 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTT T GGGTGAATCACCGAT GCCATT TAAGAGGGTT TTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T TAATTT TT AACTGATAGT TGTACATAT T TGGGGG TACATGTGATAT T TGGATACATGTATACAA 
TATATAATGAT CAAATCAGGGTAAC T GGGATAT CCAT CACATCAAACATT TAT T T T T TAT TCTT T 
TTAGACAGAGT C T CAC TCT GTCAC CCAGGCT GGAG TGCAGT GGTGCCATC TCAGCT TACT GCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
||1 AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
P AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T TCCATC CATGT TGC TGCAAAT GACAGGAT T TCGT TCT T AAT T T CAAT TAAAATAACCACACATG 



U! 



m 



fit 



GCAAAAA 
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FIGURE 13Q 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 



TTCTGAAGTAACGGAAGCTACCTTGTATAAAGACCTCAACACTGCTGACCATGATCAGCGCAGCCTGGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT AC C AGAGGAT GCTACAACTCTCTACCTT CAGAACAACCAAAT AAAT AATGCTGGGAT T CCTTC AGAT 
TT GAAAAACT T GCT GAAAGT AGAAAGAATATACCT AT ACC AC AAC AGTTT AGAT GAATTTCCTACC AACCT 
CCCAAAGT AT GT AAAAGAGT T ACATTTGCAAGAAAATAACAT AAGG ACTATC ACTT ATGATT CACTT T CAA 
AAAT T C CCTAT C TGGAAG AATT AC ATTT AG AT GAC AACTCTGTC T CTGCAGT T AGC AT AG AAGAGGGAGC A 
1^ TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
|*| GCCC AGGACT AT AGAAGAACTACGCT TGGATGAT AATCGC AT AT CC ACTATTTC AT C ACC AT CT CTT C AAG 

ft GTCTCACTAGTCTAAAACGCCTGGTTCTAGATGGAAACCTGTTGAACAATCATGGTTTAGGTGACAAAGTT 
1 1| TTCTTC AACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 

III TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
0} CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 
GATGAT TT GGAC AAT AT AACACAACT GATTCTTCGC AACAAT CC CTGGTATT GC GGGTGC AAGATG AAATG 
GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 
AGGTTCGTGGGATGGCTATTAAGGATCTCAATGCAGAACTGTTTGATTGTAAGGACAGTGGGATTGTAAGC 

i% AC CAT T CAGAT AAC C ACT GCAATACC C AACAC AGT GTATCCT GC CC AAGGAC AGT GGCCAGCTC C AGTGAC 

IpI 

fn CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 

M 

r?| CAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCTATG 

tn 

f*\ AC TGC T TT GAGACT C AGCTGGCTT AAACTGGGCC AT AGCCCGGCATTTGG AT C T AT AAC AGAAAC AATTGT 

HI AACAGGGGAACGCAGTGAGT ACT T GGTC ACAGCCCT GGAGCCTGATTC ACCCTAT AAAGT AT GC ATGGTTC 

CC ATGGAAACCAGCAACCTCTACC T ATTTGAT GAAACTCCTGTT TGT ATT GAGACTGAAACT GC ACCCCT T 
CGAATGTACAACCCT ACAACCACC CTCAATCGAGAGCAAG AGAAAGAACCT T ACAAAAACCCCAATTT ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT CAT AGG AAT GGATCGCTC T TCT C AAGGAACT GTGCATAT AGC AAAGGG AGGAGAAG AAAGGAT G ACT AT 
GC AGAAGCTGGC ACT AAG AAGGACAACTCT AT CCTGGAAATCAGGG AAACTT CT TTTC AGAT GT T ACC AAT 
AAGC AATG AACC CAT CTCGAAGGAGGAGTT TGTAAT AC AC ACCAT ATT TCCT CCT AATGGAATGAATC TGT 
AC AAAAAC AATC AC AGTG AAAGCAGT AGT AAC CGAAGCT ACAGAGAC AGT GGT AT T CC AGACTCAGAT CAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVS AVS I EEGAFRDSNYLRLLFL S RNHLS T I PWGLPRT IEELRLDDNRI STISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSl^NLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNWGLMCQAPEKVRGMAIKDLN^ 

IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAI IGGAVALVT I ALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 8 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites . 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE iaa 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATCTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GAC CACT CCAACAGAGCCAGGACT ATATCAACC TCT T CT GC GC CAACATGATGGACT TGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACAT CT TCATGGAAAACATCATGT TCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCT TCGGGGAGCCTGAT GCTGAAGAT GAAGAAT TATC TAAAGCTAT TCAA 
TATCAGCAGCAT T T T TCGAGGAGAGT GAAGAGGCGAGAAAAACAAT T TCCAGAT T CTC GCTC TGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AAT TC TCT TGCC TCATCC TCCCGAG T AGC TGGGAC TACAGGAGCGTGCCACCAT ACC T GGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCT GAAAAGCAC TT T CAAAGAGACTGTGT TGAATAAAGGGCCAAGGTT CT TGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGC T T CCTATACAT CC T GGCAGAATAC CCCCCAGCAAACAGAGAGC CACACC CATCCACACCG 
C CACCACCAAGCAGCC GC T GAGACGGACGGTTCCAT GCCAGCT GCC TGGAGGAGGAACAGACCCC 
T TT AGTCC T CATCCC T TAGAT C C TGGAGGGCACGGAT CACATC C TGGGAAGAAGGCAT CTGGAGG 
ATAAGCAAAGCCAC CCCGACAC CC AATCT TGGAAGCCC TGAGTAGGCAGGGCCAGGG TAGG TGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE lgj. 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKAT IADL I LSALERAT VFLE QRLPE INLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANbMDLNRRMAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain : 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-rnyr i s toyl a ti on si te s . 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 135 

GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGAT T TGATCCCGTT CGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
CCACCTT CTTACTT TAT TAAAAGGGAATCGTGGGGCT GGACAGACTT TCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGACATGAGACGGGAAATGGAGCAGTCAATGAATAT GC TGAAT TCCAACCATGAGT TGCCT GAT 
G T T TCT GAGT TCAT GACAAGAC TCTTCTCT TCAAAAT CATC T GGCAAATC TAGCAGCGGC AGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACAC TGGG TGGCAT CCAAG TCT TGGAA&ACCG T G TGAAGCAACT ACTATAAACT T 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AAC TT TT TAGCACAT GT T TT GT ACT T GGTACACGAGAAAACCCAGC T T TCATCT T T TGTCTGTAT 
GAGGT CAAT AT TGATGTCACTGAAT TAAT TACAGTG T CC TATAGAAAATGCCAT TAAT AAAT TAT 
ATGAACT ACTATACAT TATGTAT AT TAAT TAAAACATCT TAAT CCAGAAAT CAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGS FWHDI PS GS YWEWS PAYRFDPVRVD I TSKGKMRARYVNYIKTS E 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFI^PMV^^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 



r I 

til 
fn 

?r>s 
• r- 



Important features of the protein: 
Signal sequence: 

amino acids 1-23 



P 

III Transmembrane domain: 



amino acids 161-182 



hi 
IM 

Sfjf'lK 

||l N-glycosylation site. 

- amino acids 184-187 



Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 



Amidation site. 

amino acids 238-241 



ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCT GT G T GATCGCCACAAAC C T TCAGGAAATACGAAATGGAT T T TC TGAGATACGGGGCAGTG 
TGCAAGC CAAAGATGGAAACAT TGACAT CAGAATC T T AAGGAGGAC T GAG T C TT TGCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAAC T AC CAGACCCCTGACCAT TATACT C TCCGGAAGAT CAGCAGCC T CGCC AAT TCC TTT C TTA 
CCATCAAGAAGGACC TCCGGCT C TC T CATGCCCACAT GACATGCCATT GT GGGGAGGAAGCAATG 
AAGAAATACAGCCAGAT TC T GAG TCACTT TGAAAAGCTGGAACC T CAGGCAGCAGT T GTGAAGGC 
TTT GGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAGT GATGCTG 
C TG C T AAGAAT AT T CGAGG T CAAGAGC T C CAGT C T T CAAT ACC T GC AGAGGAGGCAT GAC C C CAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
TGTAATATCTTTCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTA 
ATGTATTTATTTTT TTACTTGGACATGAAACT TTAAAAAAATTCACAGATTATATTTATAACCTG 
ACTAGAGCAGGTGATG TAT TT T TATACAGTAAAAAAAAAAAACCT TGTAAAT T CTAGAAGAGTGG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
AT T TGAAAT TGAAC CAATGACTACT TAGGATGGGTTGTGGAATAAGT T TT GATGTGGAATTGCAC 
ATCTACC T TACAAT TACTGACCATCC CCAGTAGACTCCCCAGT CCCATAAT TG T GTAT CTT CCAG 
CCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTS FDI SPEMS FS I YSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEI F 

SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLMTPSTGLKTLNLGSCVIATNLQEIRNG 

FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 

SLANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDI^ 

E 



m 



ffl 

o 



□ 



Important features of the protein: 
Signal peptide: 

amino acids 1-42 



||| oAMP- and cGMP- dependent protein kinase phosphorylation sites. 

Ill amino acids 192-195, 225-228 



N-myristoylation sites. 

amino acids 42-47, 46-51/ 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACGTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAG T C T T T TAT GGAGATGAAACCT GGGT TAAAT TA 
T TCCCAAAGCAT T T T G TGGAAT ATGATGGAACAACCTCAT T TT TCG T G TCAGAT TACACAGAGGT 
GGATAATAATGTCACGAGGCATTTGGATAAAGTATTAAAAAGAGGAGATTGGGACATATTAATCC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGCT GAGCGAGATGGACAGCGT GCT GAT GAAGATCCACACCT CAC T GCAGTCGAAGGAGAGAGA 
GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
GGGCCTCCT CCACC GAGG AGG T GAAT ACACC T C T GAT T T TAAT CAG T T C T GC GT T T GAAAGGAAA 
CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
ACT TGGCT TACCGAT TCCAAAAGACAGTGTAGGGAGC CT CCTAT TCCCAGT TGTGGAAGGAAGAC 
CAATGAGAGAGCAGT TGAGATTTTTACATTTGAATACAGTGCAGCTTAGTAAACTGTT GCAAGAG 
AATGTGCCGTCATATGAAAAAGATCCTGGGTT TGAGCAGT T TAAAATGTCAGAAAGAT TGCATGG 
GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 
TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
AGGGGGC CT CAGGGAAGGACGT GGAGCAGCC T TATCC CAGGCC TCTGGGTGT CCCGACACAGGTG 
TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACT GC AGTGGCAGCAGGACT GT TGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C T GCACACAGTAT GTAGT TACCAAAAGAATAAACGGC AATAAT TGAGAAAAAAAA 
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FIGURE mo 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of* the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-7 6 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Ami da ti on site, 
amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGT T CCAGATAAAAAC T ACAT ACGCCCAGAGATCT T CT T T GCAT TAGCC TCATCCT T GAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGCTGAAGAAGGAGAAACTGATGAAGCT 

f* k GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 

fl 

2; ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 

|| 

jffij G T TGGGGTGACAGATAAAT TTGAGAACAGGAAACACAT T GAAT T T TCATT TCAACCAGT TT GCAA 

||| AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 

If! AT TTGAAC T AAT TG TATAAAAACACCAAACCTGC TCACT 



p 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALAS S LS SASAEKGS P I LLGVSKGE FCLYCDKDKGQSHPSLQLKKEKLMKLAAQKES ARRPFI 
FYRAQVGS WNMLESAAHPGWFI CTSCNCNEPVGVTDKFENRKHIEFS FQPVCKAEMS PSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 

C TAGAGAG TATAGGGCAGAAGGAT GGCAGATGAGT GACT CCACATCCAGAGCT GCCTC CCT T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGT T TGCTGTCCACACAT T CAACCAACAGAGCAAGGACTAC TAT GCC T ACAGACTGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
GGGGAGAACTAGGTGT GGGAAAT T TGAAGACGACAT T GACAAC TGC CAT T T CCA&GAAAGCACAG 
AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATC T T GAGCC TAAT CATGTAG T GTAGAT CAT TAAACATC AGCAT T T TAAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



App_ID=10063546 



Page 293 



FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 



C T GT GCAGCXCGAGGC TCCAGAGGCACAC T CCAGAGAGAGC CAAGGT TCTGACGCG ATGA GGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGC GGCGAACCAGGGGGAG TTCCAGAAGC CAGACAAC AAGC TCCACCAGCAGG T GCT CTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 



fn TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
Cft AAGCT TGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCC TGGCAAGTGACCCAGCT 

III CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 

TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 

CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 

IP 

J GAGAAGAACACATCAGGCACTGCGCCACC T GCT TCACAGTACT TCCCAACAACT C T TAGAGGTAG 

Q G T GTATT CC CGT T TTACAGATAAGGAAAC TGAGGCCCAGAGAGCTGAAGTACTGCACCCAGCAT C 

!FI ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 

G TCCAAT T CCAGAGCTGT CT GGTGAT CAC T T TATG TCT CACAGGGACC CACATC CAAACAT GTAT 



C TCTAAT GAAATTGTGAAAGCT CC AT GT T TAGAAATAAAT GAAAACACC TGA 
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FIGURE 146 

MRKHLSWWWLATVCMLLFSHLSAV^ 

LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 



Ivtk 



Important features of the protein: 
Signal peptide: 

amino acids 1-26 



Transmembrane domain: 

III amino acids 157-171 

w 

N-glycosylation sites. 

p amino acids 98-102, 110-114 

fj| Tyrosine kinase phosphorylation site. 

Ill amino acids 76-83 

p 

as s 

p| N-myristoylation sites. 

K amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
C TGTCTT CT T T C CC TC TT TCT T C T TAT T T TAAT TAGT AGCATC TAC T CAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAAC T T T CAGAT TCAGGGGGTACATGTGAAGGT TT GT TT TATGAGTATAT T GCATGAT GC TGAGG 
TTTGGGGT 
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FIGURE 148 



MFRS SLLFWPPLCLLSLFLL I LI SS I YSESCKLE I FHFACQWGRSLSLS FYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 




N-myristoylation site. 



amino acids 62-68 



Si 
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FIGURE ldQ 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTTGAATGAACAGATTGCTGAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGT T GATAACTT GAAC CTGCTAAAGGCAATAACAGAAAAGGAAAAAAT TGAGAAAGAAAGACAAT CTA 
TAAGAAGCTCCCCACT T GATAATAAGT TGAAT GTGGAAGATGTTGATTCAACCAAGAAT CGAAAACTG 
ATCGATGATTAT GACTCTACTAAGAGT GGAT T GGAT CATAAATTT CAAGATGATCCAGATGGTCTTCA 
m TCAACTAGACGGGACTCCTTTAACCGCTGAAGACATTGTCCATAAAATCGCTGCCAGGATTTATGAAG 
III AAAATGACAGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGC 
111 CAAGCACATACACTGGAAGAT GAAGTAGCAGAGGTT TTACAAAAATTAATCTCAAAGGAAGCCAACAA 

T TAT GAG GAGG AT CCC AATAAGC CC ACAAGC T GGAC T GAGAAT C AGG C T GGAAAAATAC C AGAGAAAG 
TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
!U TTAACCT T GACAAATGGCT TGGAAAGGAGAACTAAAACCT ACAGT GAAGACAACT TTGAGGAACTCCA 

| s| ATATTTCCCAAA.TTTCTATGCGCTACTGAAAAGTAT TGAT TCAGAAAAAGAAGCAAAAGAGAAAGAAA 

fi CAC TGAT TACT AT CAT GAAAACACTGATTGACT TTGTGAAGATGAT GGT GAAATATGGAACAATATCT 

HI CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
CI AGAA?U^yyiTGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
f ** CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAA 

GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAA.GATGAGAGACTTCAT CAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTT GACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATT TAT AGCAGCCTGTAAAAAT GGCAAAAGAT CCAGGAGTCT T T CAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAAT TCTGT GATTAAAATTTTTT GACCCAAGG 
GTTATTAGAAAGT GCTGAATT TACAGT AGT TAACCT TTTACAAGT GGT TAAAACATAGC TTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 



MGFLGTGTWILVLYLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKIEKERQS IRS S PLDNKLNVE DVDS TKNRKL I DDYDS TKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS I DSEKEAKEKETLI T IMKTL IDFVKMMVKYGT I SPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAI RKNI E WLKKHDKKGNKE DYDLSKMRDF INKQADAYVEKGI LDK 
EEAEAIKRIYSSL 

ili N-glycosylation sites: 

&?* 

|l| amino acids 68-71, 346-349, 350-353 



Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

*% 220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 

J! 341, 369-372, 382-385, 386-389, 387-390 



Mf N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE isi 



5™ 

m 



C GGCT CGAGGCT CC CGCCAGGAGAAAGGAACAT TC TGAGGGGAG T C TACACCCT GT GGAGC TCAA 
G ATGG TCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAAT AACCAGC TT CTAGCTGGAGGGCT GCAT GCAGGGAAGGT CAT TAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGT GCCTGTCATGTGGGGTGGGGCAGGAGCCGACT CTAACACTAGAGCCAGT GAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
PS. CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
-1? CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
M GGGTCAGTAGCT CTCCACATGAAGT C CT G TCAC T CAC CAC T GT GCAGGAGAGGGAGG TGGTCATA 

||| GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
Iff TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 

GAAAT GGC T CGAGC T CAGAAGATAAAAGATAAGTAGGGT AT GC TGAT CCTC T T TTAAAAACCCAA 
GATACAATCAAAATCCCAGATGCTGGTCTCTATTCCCATGAAAAAGTGCTCATGACATATTGAGA 
AGACCTAC TTACAAAG TGGCATATAT TGCAAT T TATT TT AATT AAAAGATACCTATT TATATATT 
p TCT T TATAGAAAAAAG TC TGGAAGAGTT TAC T TCAAT TGTAGCAATGTCAGGGT GGTGGCAGTAT 

f*| AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
|P AT T CCT T G TATAAAAATAAGAAAAGAAAT TAATCT TGAGG T AAGCAGAGCAGACATCATCTCTGA 

5* TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
M T TGTAGTAGTGATCAGGAAACAGAT CTCAGCAAAGCCACTGAGGAGGAGGCTGT GCTGAGT T TGT 

f : f| GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
g|. GAT CACAGCCCC TGGGAT T CCAAGGCAT T GGATCCAGTCT CTAAGAAGGCT GC T GTAC T GGTTGA 

ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
GACTGGTTTCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGAT GAAGGCAGAGATC GGAGT TT T GCAGCCACAAGCTAAGAAACACCAAGGAT T GT GGCAACC 
ATCAGAAGC T TGGAAGAGGCAAAGAAGAAT TCT T CCC TAGAGGCT TTAGAGGGATAACGGCTC TG 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAAT TGG T TACAGCAGC T CTAGGAAACTAATACAGC TGC TAAAATGAT CCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
T AGAAT AT G G CAGAAG T GAT GG CAT G C CAC T T C CAAGAT T AGGT T ATAAAAGAC AC T GCAGC T TC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACAT TAGTGAACC TAGAAGCAGAGACT CT G T GAGATAATC GATGT TTGTTGTTTT 
AAGTTGCTCAGTTTTGGTCTAACTTGTTATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 1S2 

MVLS GALCFRMKDSALKVliYLHMQLl^GGLHAGKVI KGEE I SWPNRWLDASLS PVI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPI TDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature . 

amino acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 153 



C T T CAGAACAGG T T CTCC T T C C C C AGTCACCAG T T G C TC GAG T T AGAAT T GT C T G CA ATGG CCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
C-AACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
|?k TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
TTGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATTTGCTGTTTATGTCTCTGAGAAAT 
GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
CAATTAGATGCCCCAAAGCGATTTTTTTTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGACTTTCTAAGCATAGATAT TTATTGATAACATT TCAT TGTAACT GGTGTTC 
TAT AC AC AGAAAAC AAT T TAT T T T T TAAAT AAT TGTCTTTT TC CATAAAAAAGAT T AC T T T CC AT 
TCC T T TAGGGGAAAAAACCCCTAAATAGC TTCATGT T TCCATAATCAGTACT T TAT AT T TAT AAA 
TGTAT TTATTATTATTATAAGACTGCATTTTAT TTATATCATT TTATTAATATGGATTTAT TTAT 



ft- 

hi 



Ejf 



AGAAACATCAT TCGATAT TGC TACT TGAG TG TAAGGCTAATAT TGATAT T TATGACAATAAT TAT 



III AGAGCTATAACATGT TTAT TTGACC TCAATAAACACT TGGATAT CCC 

Q 
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FIGURE 



MAALQKSVSS FLMGTLATSCLLLLALLVQGGAAAPI SSHCRLDKSNFQQPYI TNRT FMLAKEASL 
ADNNTDVRLIGSKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 




N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 



N-myristoylation sites. 

amino acids 14-20, 82-88 



Prokaryotic membrane lipoprotein lipid attachment site. 



amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGAT TAGG T GAGGACAGTT C TC TCAT TAGC CTT T T CC TACAGGT GGT TGCAT 
TCTT GGCAAT GG TCAT GGGAACCCACACC TACAGCCACT GGCCCAGC TGC TGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGT6TGTGTGTGCGGCCCCGT 
GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T TGACATGGATGATTC TGAGGAGGAAGCTGTTAT TGAATGTATAGAGATTTATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE is6 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 



Important features of the protein: 
Signal peptide: 

amino acids 1-32 



N-glycosylation site. 



□ 

m 

m amino acids 136-140 



I?!' Tyrosine kinase phosphorylation site. 

3* amino acids 127-135 

III 

f*$ N-myristoylation sites. 

pi amino acids 44-50, 150-156 

P 
III 
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FIGURE 157 



h0 

ill 

13 

fii 

fit 



CCGGCG ATG TCGCTCG TGCT GCTAAGCC TGGCCGCGC T GTGCAGGAGCGCCG TACCCC GAGAGCC 
GACCGTTCAATGTGGCTCTGA?\ACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
C CGGAGACT TGAGGGACC TCCGAGTAGAACCTG T TACAACT AGTGTTGCAACAGGGGACTAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTT GT GTGACGGGCAAAAGCAACT TCCAGTCCTACAGCT GT GTGAGGTGCAAT TACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCT GAACACAGT CTATT T CAT T GGGGCCCATAATAT T CC TAAT GCAAATATGAATGAAGATGG 
CCCT T CCATGTC TGTGAAT T TCACC T CACCAGGCTGC C T AGAC CACATAATGAAAT AT AAAAAAA 
AGT G T GT CAAGGCCGGAAGCCT G TGGGATCCGAACAT CACT GCT TGTAAGAAGAAT GAGGAGACA 
GTAGAAGTGAACTTCACAACCACTCCCCTGGGAAACAGATACATGGCTCTTATCCAACACAGCAC 
TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
TCACACAAT TT G T TAC TTCACT GAAT T TCT TCAAAACCAT T GCAGAAGTGAGGT CAT CCTTGAAA 
AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAG T GCAGTGGC T TGCCAC TCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
G TGAT CT AAGAAGCCAGAT T CATC TGCACAAA.TACG T GGT GGTCTACT T T AG AG AG AT T GAT AC A 
AAAGACGAT TACAATGCTCTCAGTGT CTGCCCCAAGTACCACCTCATGAAGGAT GCCACTGCT TT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE is8 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEIXSPSMSWETSPGCLDHIMK^ 

STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWLVAGIYLMWRHERIKEeTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWIATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWY FREIDTKDDYNALSVCPBCYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP- dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 



App_ID=10063546 



Page 308 



FIGURE 15Q 



lif 
If! 



m 

1*1 



AGCCACCAGCGCAA CATGA CAGTGAA.GACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCA7\AGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAAT T AC ACT GT CAC T T G G GAC CCCAACC GGTACC CC T C G GAAG T T G T ACAGG C CC AG TGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
ACT CAGCTGAAGAAG 
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FIGURE 160 



MTVKTLHGPi^MVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 



n 

Si'! 1 

01 



5-5? 



I'll 
P 



Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 



||| N-rayristoylation sites. 

^ amino acids 106-111, 136-141 
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FIGURE 161 



ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAlGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCT GAAGAT GAGGAAAAGT TTGGAGGAGCAGCT GACT CAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
n T GCT T CGAGGCT GCCC TAGGGAGT GAGGTACGAATCT GGTC CT ATAC TCAGCCCAGGTACGAGAA 

GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
IP GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
||J CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
lit CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
1« CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
g AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
|*| CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
m G TGGC TGTACT T CACTAC CCAGCAAAGCC TCCACGAGGGCAGCTCGCC TT GGAGAGTACTTACTA 

CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
W CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
r|| CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
p CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
ig* CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 



1 



CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAAATAAAGGCAGACGCTGTTT TTCTAAAAAAA 
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FIGURE 162 



MP VPWFLLSL ALGRS P WLS LE RL VG PQ DAT H C S PGL S CRLWDS DI LCL PGD I VPAPG P VLAPT HLQT EL V 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEECELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQS WLLD APC S LP AE AALCWRAPGG DPCQ PL VP PL SW EN VT VDKVLE F PLL KGH PNLC VQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKY I H KRWALVWLACLL FAAALSL I LLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLL Y SADDSGFE RLVGALASALCQLPLRVAVDLWS RRELSAQG PVAW FHAQ RRQTLQEGGWVLL FSP 
pi GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 

LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 



||| Signal sequence: 

HI amino acids 1-20 



III 

P 



Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-58 6 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 



N-iriyristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



□ 



1*1 

fit 

m 



in 



GGGAGGGCTCXGTGCCAGCCCCG ATGA GGACGC TGCT GACCAT C T T GACTGTGGGAT C CCTGGC T 

GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 

AAACATCCTGACGTGGGACAGCGGGC CAGAGGGCACC CCAGACACGGTCTACAGCAT CGAGTATA 

AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 

AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 

G T CAGT GCGGGAGGCCGG TCAGCCAC CAAGAT GAC T GACAGGT TCAGC TC TCTGCAGCACACTAC 

C CTCAAGCCACCTGATGT GACCTGTATC T CCAAAGT GAGATCGAT TCAGAT GAT TGT T CATCC TA 

CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 

T TCTACCACT TAGAGCT C CAGGTCAACCGCACC TACCAAATGCACC T TGGAGGGAAGCAGAGAGA 

ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 

C CTGGGC CAAGGAGAGT GCC CCCT ACATG T GCCGAG T GAAGACACTGCCAGACC GGACAT GGACC 

TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 

CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 

TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 

TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 

CCCTGGGGAT T TGCACAGACAGAACATCT GACCCAAATGTGCTACACAGTGGGGAGGAAGGGACA 



M ; l CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
f*| CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
i|| TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCi^GAGCCCAGCCCCTGAGACCTCAGACCTG 
1% GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
M AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
fU ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCT GGAAAAGAAC CAGGCCT GGAAAAGAACCAGAAGGAGGC TGGGCAGAACCAGAAC AACC TGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCC T T T TCT GCAGGCAGGAGTT TC AGACCC T 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATC T GTAAT G T GGGGATCATAACACC TAC CT CATGGAG T TGT GG TGAAGATGAAATGAAGT CATG 
TCT T TAAAGTGC TTAATAGT GCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTAT TT AAAAA 
AAAAAMA 
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FIGURE 164 



ff** 

pi 
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r* 



r s :i 



Q 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAECKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRS I QMI VHPT PT P I RAGDGHRLTLED I FHDLFYHLELQVNRTYQMHLGGKQRE YE FFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYS FSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 



Transmembrane domain. 

f% amino acids 233-250 



N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 16* 



TGGCCmCTGGAAAAAAA/yWWUW^ 

CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGT GT TGAACACAG T T T TGGATAT T T TCCAAAAGAT T TGAT CAAGG TACT TCATAAATACACGGA 
AGAAGAGC T ACATAT TCCAGCAGATGAGACAGACT T T GT C TGCT TTGAAGGAGGAAGAGATGAT T 
* fe T TAATAGT T ATAAT GTAGAAGAGCT T T TAGGATCT T T GGAACT GGAGGAC TCT GTACCTGAAGAG 

*f TCGAAGAAAGCTGAAGAAGT T T CT CAGCACAGAGAGAAAT CTCCTGAGGAGTCT CGGGGGCGTGA 



m 



ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 

||| CAGAGAGCACCGAGGGGC TGCAGGGACAGCCCTCAGC TCAGGAGAGCCACCC T CACAC CAGCGGT 

If] CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 

*P ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 

m 

B AGACAGATGC T TACAAAGTCCT GAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGT TAT T 

f? l CAT TACAGC AAAGGATT TCGT T GGCATCAAAATCTAAGT TT GT T T TACAAAGAT TGT T TT TAG TA 

HI C TAAGCT GCCT TGGCAGTTT GCAT T T TTGAGCCAAACAAAAAT ATAT TAT T T T CCCT T CTAAGTA 

CI AAAAAAAAAAAAAAAAAAAA. 
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FIGURE 166 



MAAAPGLLFWLFVXGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGS LELWAGS VEHS FGYFPKDL I KVLHKYTEEELHI PADE TDFVC FEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFFADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 



m 



HI 



Important features of the protein: 
Signal peptide: 

amino acids 1-22 



|l| N-glycosylation site. 

ill amino acids 294-298 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-7 6. 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCC^^CGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGT TAATAGAGAGCAT GTT GCAAAAC C TGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGC CT GC CC AGATC TC TACCAGTGG T C TGATGGAAGCAAT T CCCAGTACC GAAACT GGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
C TT GGGGG T CCC TACC TTTACCAGTGGAATGATGACAGGT GTAACATGAAGCACAAT TAT AT TTG 
CAAGT AT GAACCAGAGATTAAT CCAACAGCCCC TGTAGAAAAGCCT TATC T TACAAAT CAACCAG 
GAGAC AC C CAT CAGAAT GTGGTTGT T AC T GAAG CAG G TAT AAT T C C CAAT C T AAT T TAT GT T G T T 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAG T CTACACT GTGGAT TTCAAAGAGTA 
CCAGAAAAGAAAGT GGCATGGAAGTATAATAAC TCAT TGACT T GGT TCCAGAAT T T TGTAAT TC T 
GGAT CTGTATAAGGAATGGCAT CAGAACAATAGC TTGGAAT GGC T TGAAATCAC AAAGGATC TGC 
AAGAT GAACTGTAAGC TCCCCCT TGAGGCAAATATTAAAG TAAT TT TTATATGT CTAT TAT T TCA 
T TT AAAGAATATGC TGTGCT AATAAT GGAGTGAGACATGC T TAT TT TGCTAAAGGAT GCACCCAA 
ACT TCAAACT TCAAGCAAAT GAAATGGACAATGCAGATAAAGT TGT TATCAACACGT CGGGAGTA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
T GTAT AT TGTAT TGAAATT T ACAGTGTGCAAAAGTAT TT TACC T T TGCATAAGT G T T T GATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAACT TAT T AC T GT TGTCAAC T GAAT TCACACACACACAAATATAG TACCATAGAAAAAGT TTGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AAT AAGAAGCTATT TCAT TAAGTGT GATATAAACCTC CT CAAACAT T T TAC T TAGAGGCAAGGAT 
T GT CT AAT T TCAAT TG TGCAAGACAT GTGCCT TATAATT AT TT T TAGC TTAAAATTAAACAGAT T 
T TGTAAT AATGTAACT TTGT TAAT AGGTGCATAAACACT AATGCAGTCAATT TGAACAAAAGAAG 
TGACATACACAATATAAATCAT AT G T C T TCACACGT T GC C T AT ATAATGAGAAGCAGCTCT CTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACAT T T T AGGAAGGAAAGGAAC TACGAAATC GTGTGAAAAT GGG T TGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAMAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVS FQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
P YLTNQPGDTHQNVWTEAG 1 1 PNL I YWI PT I PLLLL I LVAFGTCC FQMLHKSKGRTKTS PNQ 
STLWISKSTRKESGMEV 



Important features of the protein: 
Signal peptide: 

amino acids 1-21 



I?* 

JJ* Transmembrane domain: 

hi 



amino acids 214-235 



N-glycosylation sites. 



"t* amino acids 86-89, 255-258 

m 



pi cAMP- and cGMP-dependent protein kinase phosphorylation site. 



it% amino acids 2 66-269 



FS5. 

Hf N-myristoylation sites. 



amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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